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Preface 


The subjects of relativity, gravitation, and cosmology are frequently discussed areas of 
modern physics among the broad audience. Numerous popular books on these subjects 
are available for the general reader who has no background in physics. At the same 
time, a number of textbooks and monographs for advanced graduate students and young 
researchers in the area are available. 

However, intermediate books, allowing senior undergraduate students or junior PhD 
students to enter this exciting area of physics in a smooth and pedagogical way, are rare. It 
is generally assumed that new concepts of both physics and mathematics involved in the 
subject are too complex to allow a simple and pedagogical introduction. 

The only way to master the subject seems to be to exert a hard and time-consuming 
effort to put together the intelligible pieces of advanced textbooks into an understandable 
set of notes, as we and many of our colleagues have done. This approach has the advantage 
of making young, capable students who persevere through such a learning process well 
trained for future research tasks. However, at the same time, it severely limits the number 
of people who ever really master the subject. 

This book is an attempt to bridge the gap between a regular university curriculum, 
consisting typically of courses in calculus and general physics, and the more advanced 
books in tensor calculus, relativity, and cosmology. The book has evolved from a set 
of lecture notes originally compiled by one of the authors, M. Dalarsson, but has been 
improved and completed with a number of exciting new topics over the past 20 years. 

It is the authors’ intention to give readers a high level of detail in derivations of all 
equations and the results. The more lengthy and tedious algebraic manipulations are, 
in general, outlined in such detail that they can be followed by an interested senior 
undergraduate student or a junior PhD student with little or no risk of getting lost. 

It is our experience that common showstoppers for a young university student trying 
to master a subject are phrases in the literature claiming that something can be derived 
from something else by some “straightforward, although somewhat tedious, algebra.” If 
a student cannot readily reproduce such a “straightforward” algebra, which most often is 
the case, the usual reaction under the time pressure of the studies is to accept the claim as 
a fact. From that point, throughout the rest of the course, the deeper understanding of the 
subject is lost. 

Compared to the first edition (Elsevier Academic Press, 2005), the present edition has 
two new chapters (14 and 22) that have been added, and the chapters from 14 to 24 from 
the first edition have consequently been renumbered. The new Chapter 14 deals with 
some well-known phenomena from the special theory of relativity (time dilatation, length 


xi 


xii Preface 


contraction, addition of velocities, and the twin paradox). The present book focuses on the 
mathematics of relativity and cosmology. These topics were omitted in the first edition as 
being more descriptive and phenomenological. However, based on the strong feedback 
received from readers of the first edition, both students and instructors, we decided to 
include an introductory chapter on these phenomena in this new edition for the sake of 
completeness. 

The new Chapter 22 deals with an important topic of gravitational waves, ft was omitted 
in the first edition based on our assumption that a thorough understanding of a chapter 
on gravitational waves would require a reader already have an advanced mathematical 
knowledge of some wave theory (for example the electromagnetic wave theory) . At the 
same time, we wanted the present book to be as self-contained as possible, so that a reader 
would not have to refer to any other sources in order to understand the entire contents of 
the book. 

Needless to say, in both new chapters, we preserve the same style of presentation 
as in the rest of the book. In other words, we still outline all the lengthy and tedious 
algebraic manipulations in such detail that they can be followed by an interested senior 
undergraduate student or a junior PhD student with little or no risk of getting lost. Last, 
but not least, we have made a number of minor improvements and corrections of a few 
typos in the chapters that appeared in the first edition, 

There are a number of advanced books on relativity, gravitation, and cosmology, and we 
have benefited from some of those, as well as from unpublished notes produced by some 
of our distinguished colleagues. Some of these sources are listed in the bibliography at the 
end of the book, as well as a few other books that may be recommended as suitable further 
reading. However, in an introductory book such as this one, it is impossible to include an 
extensive list of all original references, major textbooks, and monographs on the subject 
or to mention all the people who have contributed to our understanding of this exciting 
subject. 

We hope our readers will find that we have, at least partly, fulfilled the objective 
of bridging the gap between the regular university curriculum and the more advanced 
literature on this exciting subject, and that they will enjoy reading this book as much as we 
did writing it. 


M. Dalarsson and N. Dalarsson 
Stockholm, Sweden 



Introduction 



The tensor calculus is a mathematical discipline of relatively recent origin. It is fair to say 
that, with few exceptions, the tensor calculus was developed during the twentieth century. 
It is also an area of mathematics that was developed for an immediate practical use in 
the theory of relativity, with which it is strongly interrelated. Later, however, the tensor 
calculus has proven to be useful in other areas of physics and engineering such as classical 
mechanics of particles and continuous media, differential geometry, electrodynamics, 
quantum mechanics, solid state physics, and quantum held theory. Recently, it has been 
used even in electric circuit theory and some other purely engineering disciplines. 

In the early twentieth century, at the same time when the tensor calculus was de- 
veloped, a number of major breakthroughs of modern science were made. In 1905 the 
special theory of relativity was formulated, then in 1915 the general theory of relativity 
was developed, and in 1925 the quantum mechanics took its present form. In the years 
to come quantum mechanics and special theory of relativity were combined to develop 
the relativistic quantum held theory, which gives at least a partial explanation of the three 
fundamental forces of nature (strong, electromagnetic, and weak) . 

The remaining known fundamental force of nature, the force of gravity, is different 
from the other three fundamental forces. Although very weak on the small scale, gravity 
dominates the other three forces over cosmic distances. This dominance, due to gravity 
being a long-range force that cannot be screened, makes it the only available foundation 
for any cosmology. The other three fundamental forces are explained through particle 
interactions in the hat space-time of special relativity. However, gravity does not allow 
for such an explanation. In order to explain gravity, Einstein had to connect it with the 
geometry of space-time and formulate a relativistic theory of gravitation. For a long time, 
the general relativity was separate from the other parts of physics, partly because of the 
mathematical framework of the theory (tensor calculus), which was not extensively used 
in any other discipline during that time. 

The tensor calculus is today used in a number of other disciplines as well, and its 
extension to other areas of physics and engineering is a result of the simplihcation of 
the mathematical notation and in particular the possibility of natural extension of the 
equations to the relativistic case. 

Today, physics and astronomy have joined forces to form the discipline called relativis- 
tic astrophysics. The major advances in cosmology, including the attempts to formulate 
quantum cosmology, also increase the importance of general relativity. A number of 
attempts have been made to unify gravity with the other three fundamental forces of 
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nature, thus introducing the tensor calculus and Riemannian geometry to the new exciting 
areas of physics such as the theory of superstrings. 

In the first two parts of the book a pedagogical introduction to the tensor calculus 
is covered. Thereafter, an introduction to the special and general theories of relativity is 
presented. Finally an introduction to the modern theory of cosmology is discussed. 



PARTI 


Tensor Algebra 




Notation and Systems of Numbers 


2.1 Introduction and Basic Concepts 

In order to get acquainted with the basic notation and concepts of the tensor calculus, it 
is convenient to use some well-known concepts from linear algebra. The collection of N 
elements of a column matrix is often denoted by subscripts as X\,X 2 , ■ ■ ■ , -Tv- Using a lower 
index i = 1, 2, . . . , N, we can introduce the following short-hand notation: 

Xi (i= 1,2,..., IV). (2.1) 

Sometimes, the same collection of N elements is denoted by corresponding superscripts 
as x 1 ,x 2 , . . . ,x N . Using here an upper index i= 1,2, . . . ,N, we can also introduce the 
following short-hand notation: 

x‘ (i= 1,2,..., AT). (2.2) 

In general the choice of a lower or an upper index to denote the collection of N elements of 
a column matrix is fully arbitrary. However, it will be shown later that in the tensor calculus 
lower and upper indices are used to denote mathematical objects of different natures. 
Therefore, both types of indices are essential for the development of tensor calculus as a 
mathematical discipline. In the definition (2.2) it should be noted that i is an upper index 
and not a power of x. Whenever there is a risk of confusion of an upper index and a power, 
such as when we want to write a square of x\ we will use parentheses as follows: 

x i ■ x‘ = (x'f (i =1,2 N). (2.3) 

A collection of numbers, defined by just one (upper or lower) index, will be called a first- 
order system or a simple system. The individual elements of such a system will be called 
the elements or coordinates of the system. The introduction of the lower and upper indices 
provides a device to highlight the different nature of different first-order systems with the 
equal number of elements. Consider, for example, the following linear form: 

ax + by + cz. (2.4) 

Introducing the labels a* = {a, b, c} and x l = {x,y, z], the expression (2.4) can be written as 
follows: 

3 

aix 1 + a 2 X 2 + a?,x? = (2.5) 

i=i 
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indicating the different nature of the two first-order systems. In order to emphasize the 
advantage of the proposed notation, let us consider a bilinear form created using two first- 
order systems x' andy* (i = 1, 2, 3): 

a\\x l y l + ai 2 x 1 y 2 + «i3X 1 y 3 
+ a 2 ix 2 y 1 + a 2 2 X 2 y 2 + a 2 3X 2 y 2 

+ omxy + «32X 3 y 2 + «33X 3 y 3 
3 3 

= aijx'yi. (2.6) 

i=i j= l 

Here, we see that the short-hand notation on the right-hand side of (2.6) is quite compact. 
The system of parameters of the bilinear form 

fly (i,j = 1,2,3) (2.7) 


is labeled by two lower indices. This system has nine elements and they can be represented 
by the following 3x3 square matrix: 


fill fii2 fil3 
fi2l fi22 #23 
#31 #32 #33 


( 2 . 8 ) 


A system of quantities determined by two indices is called a second- order system. 

Depending on whether the indices of a second-order system are upper or lower, there 
are three types of second-order systems: 

fly, flj, a i] (i,j = 1,2, . . . , AT). (2.9) 

A second- order system in N dimensions has N 2 elements. In a similar way, we can define 
the third-order systems, which may be of one of the following four different types: 

#yfc, a) k , a\, a i]k (i,j= 1,2, ..., N). (2.10) 

The most general system of order K is denoted by 


#ii,i2,...,ijr 0*i, iz> • * * , Ik — 1 , 2 ,... , IV) 


( 2 . 11 ) 


and depending on the position of the indices, it may be of one of several different types. 
The Kth - order system in N dimensions has N K elements. 


2.2 Symmetric and Antisymmetric Systems 

Let us consider a second-order system in three dimensions 

fly (i,7 = 1,2,3). (2.12) 

The system (2.12) is called a symmetric system with respect to the two lower indices if the 
elements of the system satisfy the equality 

ay = aji (i,j= 1,2,3). 


(2.13) 
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Similarly, the system (2.12) is called an antisymmetric system with respect to the two lower 
indices if the elements of the system satisfy the equality 

fly = -cijt ( i,j = 1,2,3). (2.14) 

The equality (2.14) indicates that an antisymmetric second-order system in three dimen- 
sions has only three independent components and that all the diagonal elements are equal 
to zero: 

ajj = 0 (/= 1,2,3). (2.15) 


Thus, it is possible to represent an antisymmetric second-order system in three dimen- 
sions by the following 3x3 matrix: 


0 a\2 «i3 

— «12 0 C?23 

— «13 — fl23 0 


(2.16) 


In general, a system of an arbitrary order and type will be symmetric with respect to two of 
its indices (both upper or both lower), if the corresponding elements remain unchanged 
upon interchange of these two indices. The system will be totally symmetric with respect 
to all upper (lower) indices, if an interchange of any two upper (lower) indices leaves the 
corresponding system elements unchanged. Elements of a totally symmetric third-order 
system with all three lower indices satisfy the equality 


a ijk — a ikj — a jki — a jik — a kij — a kji- (2.17) 

Analogous to the above, a system of an arbitrary order and type will be antisymmetric 
with respect to the two of the indices (both upper or both lower), if the corresponding 
elements change signs upon interchange of these two indices. The system will be totally 
antisymmetric with respect to all upper (lower) indices, if an interchange of any two upper 
(lower) indices changes signs of the corresponding system elements. Elements of a totally 
antisymmetric third- order system with all three lower indices satisfy the equality 


a ijk — a ikj — CLjki — djik — kikjj — akji ■ 


(2.18) 


2.3 Operations with Systems 

Under certain conditions, it is possible to perform a number of algebraic operations with 
systems. The definition of these operations depends on the order and type of the systems. 

2.3.1 Addition and Subtraction of Systems 

The addition and subtraction of systems can be performed only with the systems of the 
same order and same type. The addition (subtraction) of systems is performed in such a 
way that each element of one system is added (subtracted) to (from) the corresponding 
element of the other system (the one with the same indices in the same order). For 
example, the systems A lJ km and B l ]_ m can be added since they are of the same order and 
of the same type. The sum of these two systems is given by 
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4m=4m+4,n’ ^ 

and it is the system of the same order and type as the two original systems. This 
definition is easily extended to addition and subtraction of an arbitrary number of 
systems. 


2.3.2 Direct Product of Systems 

A system obtained by multiplying each element of one system by each element of another 
system, regardless of their order and type, is called a direct product or just a product of 
these two systems. Thus, for example, a product of two first-order systems a 1 and b l is a 
second- order system 

c ij = a‘b>. (2.20) 


For i,j = 1, 2, 3 this operation can be written in the following matrix form: 



a 1 ~ 


a l b l 

a l b 2 

filfi 

a 2 

[ b 1 b 2 fa 3 ] = 

a 2 b l 

fib 2 

fib 3 

a 3 

a^b 1 

fib 2 

fib 3 


(2.21) 


In general, the set of upper (lower) indices of a system, created as a product of several 
other systems, is a collection of all upper (lower) indices of all of the constituent systems. 
For example, we have 

D ikm = (2.22) 


2.3.3 Contraction of Systems 

This operation is applicable to the systems with at least one pair of indices of opposite 
type, that is, at least one upper index and one lower index. The actual pair of indices 
of opposite type is then made equal to each other and a sum over that common in- 
dex is performed. Thus, for example, by contraction of a third- order system a 1} k , we 
obtain 

N 

a 1 = ^ a'j = uj 1 + af 1 - off. (2.23) 

M 

The contraction of a system of order k gives a system of order k — 2, which is easily seen 
from the example (2.23) . The contraction of a mixed second-order system b 1 - gives a zeroth- 
order system 

N 

b = J2b i j = b\+b 2 2 + --- + b% < (2.24) 

1=1 

which is equal to the trace of the matrix [bj]. 
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2.3.4 Composition of Systems 


The composition of systems is a complex operation consisting of a product of two systems 
and a contraction with respect to at least one of the indices of opposite type from each of 
the systems. The product of the two first-order systems a k and hj is a mixed second-order 
system a k bj. By contraction of this system, we obtain the composition of the systems a k 
and hj in the form 

N 

c = = a l b\ + a 2 hz -\ 1 - a N bN ■ ( 2 . 25 ) 

7=1 


For N = 3, the result (2.25) can be written in the following matrix form: 

N 

Y, ai bj = [ a 1 a 2 a 3 ] 

J=i 



( 2 . 26 ) 


2.4 Summation Convention 

At the very beginning of the development of the general theory of relativity, in order to 
simplify derivations of various results and expressions in the tensor calculus, the summa- 
tion convention over the repeated indices was introduced. According to this convention 
when the same index in an expression appears twice, it is understood that the summation 
over that index is performed and no summation sign is needed. Thus, for example, we may 
write 

N 

Y. a.jxi = ajd, ( 2 . 27 ) 

7=1 

N N 

Y, Y. ajkX^x k = aji c x^x k , ( 2 . 28 ) 

;= l fc=i 
N N 

Y Y D jk n ^ k = D Y^ k - (2 - 29) 

;'=1 fc=l 

The repeated indices, over which a summation is understood, are usually called dummy 
indices. When using the summation convention, following rules should be kept in mind: 

1 . It is required to know exactly which range of values all indices can take. If nothing else 
is specified, it is assumed that all indices in one expression or equation cover the same 
range of integers. 

2 . When it is required to represent any of the three diagonal elements a \ , a\ or a\, in 
order to avoid confusion with the summation convention, a ^ (M = 1, 2, 3) could be 
used instead of a™ (m = 1, 2, 3). In such a case the capital letters are only used for this 
purpose and are otherwise not used as indices. 
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3 . In order to avoid confusion, whenever there are two or more pairs of dummy indices 
in the same expression, they should always be denoted by different letters and never 
by the same letters. 


2.5 Unit Symmetric and Antisymmetric Systems 

The unit symmetric system, called the 5-symbol, is the symmetric second-order system, 
defined as follows: 


ill, for i = j, 
J \ 0, for i / j. 


(2.30) 


Thus, the 5-symbol 5j for (i,j = 1, 2, 3) is a system of nine elements such that the diagonal 
elements, with indices i and j equal to each other, are equal to unity while the off-diagonal 
elements are equal to zero. The matrix of this system is the following: 



1 0 0 
0 1 0 
0 0 1 


(2.31) 


The 5-symbol is often called the substitution operator, since by composition with the 5- 
symbol it is possible to change the index label of any system. For example, it is possible to 
write 


kjCii = sja\ + sjci 2 + 5y£Z3 = clj. (2.32) 

The validity of the result (2.32) is obvious from the definition of the 5-symbol (2.30). Since 
5^ = 1 for M = 1, 2, 3 the trace of the 5-symbol in three dimensions is given by 

5j = 5} + 5| + 5| = 3. (2.33) 

The use of the 5 -symbol may be illustrated by the following example. If a system of three 
independent coordinates is given by x 1 = { x , y, zj, these coordinates, by definition, satisfy 


dx‘ _ I 1, for i -j 1 
dij ~ jo, for i^j j 


(2.34) 


Equation (2.35) can be simplified using the 5-symbol as follows: 


dx‘ 

diJ 


8j- 


(2.35) 


Another important system is the unit antisymmetric system, that is, the totally antisym- 
metric third-order system in three dimensions, called the e-system. It is denoted by e^ k or 
e,yfc and it is defined for ( i,j , k = 1, 2, 3) in the following way: 


e 


ijk 




+ 1, if ijk is an even permutation of 123, 
— 1, if ijk is an odd permutation of 123, 
0, if i = j, i = k,j = k, or i = j = k . 


(2.36) 
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Let us now write down all 3! = 6 permutation of 123 in order 

(123) zeroth permutation, 

(132) first permutation, 

(312) second permutation, 

(321) third permutation, 

(231) fourth permutation, 

(213) fifth permutation. 

The three cyclic permutations (123), (312), and (231) are even permutations and the 
corresponding e-symbol elements are given by 

e i23 = e 312 = e 231 = 1. (2.38) 


Similarly for the odd permutations the e-symbol elements are given by 

g 132 = e 321 = g 213 = _ h 


(2.39) 


and all other elements of the e-symbol are equal to zero. This indicates that the 
e-symbol has only 6 non-zero elements out of the total of 27 elements. Since the e-system 
in three dimensions is a third-order system it is not possible to represent it by a two- 
dimensional matrix, but a three-dimensional scheme is required. However, for better 
visibility a following schematic representation is also possible: 


jk 

11 

12 

13 

21 

22 

23 

31 

32 

33 

i= 1 

0 

0 

0 

0 

0 

1 

0 

-1 

0 

i= 2 

0 

0 

-1 

0 

0 

0 

1 

0 

0 

i= 3 

0 

1 

0 

-1 

0 

0 

0 

0 

0 


The most general Nth order e-symbol is defined as follows: 

1 +1, i\ h ■ ■ ■ in an even permutation of 12 • • • N, 

— 1, i\h ■■■ In an odd permutation of 12 N, (2.41) 

0, any pair of indices equal to each other. 


Using the e-symbols it is possible to write down the expression for the determinant of a 
matrix [aj], for (i,j = 1, 2, 3), as follows: 


a = det 


M 


— pktnn ^-.1 -y2 ^3 


(2.42) 


In order to verify that the expression (2.42) is indeed equal to the determinant of the matrix 
[aj], we use the expression for the determinant of a 3 x 3 matrix as follows: 


a = det 


a\ a \ 
a\ at; a 2 
a 3 a 3 a\ 


(2.43) 


a = a\(a 2 a 3 — a^af) — a^(afa 3 — a 3 a 2 ) + a^(afa 3 — a\a%) 

ioq 193 193 193 193 193 

= ala^a^ — aja^a^ + a^a^a^ — a\a\a\ + a^a^al — a^afa^. 


(2.44) 
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From the definition of the e-symbol (2.36) we see that the result (2.44) can be written in 
the form 

a = e kmn a\c^ n c^ l , (2.45) 

which is identical to (2.42). 

The determinant of a matrix in a general case when (i,j= 1, 2, . . . , N), has the form 

a = e l " 2 '" lN a\a\---a^. (2.46) 

The definition (2.46) clearly indicates the advantage of the system notation, since the 
expression for the determinant of the N x AT-matrix in the expanded form, even for the 
relatively small values of N, is quite complex. 



Vector Spaces 


3.1 Introduction and Basic Concepts 

In general, a mathematical space is a set of mathematical objects with an associated 
structure. This structure can be specified by a number of operations on the objects of 
the set. These operations must satisfy certain general rules, called the axioms of the 
mathematical space. 

In order to specify the operations and axioms used to define a vector space, it is first 
important to introduce some general concepts. The set of values (a 1 , a 2 , ... , a N ) of some 
N variables ( x l ,x 2 , . . . ,x N ) is called a point. A set of all such points for all possible real 
values of the given N variables is called a real /V-dimensional space. A vector in such a 
space is an ordered pair of points, such that it is specified which point is the first point 
(the origin of the vector) and which point is the last point (end of the vector). For example, 
if the point P(a l , a 2 , . . . , a N ) is the origin of the vector and the point Q(b l ,b 2 , . . . , b N ) is the 
end of the vector, then the vector PQ is the vector of displacement from the point P to the 
point Q. The system of values 

c 1 = b 1 — a 1 ,c 2 = b 2 — a 2 , . . . , (A = fA — cP , (3.1) 

or in a more compact notation 

c i = b i -a i (i = 1,2, ... ,1V) (3.2) 

is called the system of coordinates of the vector PQ in the AT-dimensional space. 

The vector PQ in the AT-dimensional space is thus fully determined when we know all 
of the N coordinates of the point of origin P, that is, 

a j (i= 1,2,..., IV), (3.3) 

and all of the coordinates of the vector PQ, given by (3.2). However, in many cases only the 
coordinates (3.2) are used to specify a vector and it is assumed that they can be measured 
from an arbitrary point in the N- dimensional space as an origin. Such a vector is called a 
free vector. An arbitrary vector PQ = C will, therefore, usually be identified with the system 
of coordinates 

c ! ' ( 2 = 1,2,..., IV). (3.4) 

If we adopt a common point of origin for all vectors, defined in the AT-dimensional space, 
then the position of every point in that space is determined by its position vector with 
respect to the adopted common point of origin. 

The point with coordinates (0, 0, ... , 0) is called the origin of coordinates. It is custom- 
ary to adopt it as the common point of origin for all vectors in an AT-dimensional space, 
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although it is not mandatory. The equality of two vectors a and b requires the equality of 
all of their components 

a' = b' (i = 1,2,..., AT). (3.5) 


3.2 Definition of a Vector Space 

A general N- dimensional space in which the operation of addition (subtraction) is defined 
by the equation 

c i = a i ±b i (i =1,2 N), (3.6) 

and the operation of scalar multiplication is defined by the equation 

a‘ = kb 1 (3.7) 

for an arbitrary scalar X, is called the vector space in N dimensions. The two vector 
operations satisfy the following axioms. 

1 . The commutativity of the addition 

a' + b‘ = b' + a' (i = 1,2 N) (3.8) 


2 . The associativity of the addition 

(a' + b‘) + c' = a' + ( b ' + c') (i = 1, 2, . . . , AT) (3.9) 

3 . The existence of a null- vector 0' such that 

a' + 0' = 0 ,: + a' (i=l,2,...,N) (3.10) 

4 . The distribution laws for scalar multiplication 


X(a‘ + b ') — Xa l 

+ Xb l 

(i= 1,2,. 

• • , AT) 

(3.11) 

(X + v)a l = Xa 1 

+ va' 

(i = 1,2,. 

...N) 

(3.12) 


5 . The associativity of the scalar multiplication 

(Xv)a' = \{va!) (i =1,2 N) (3.13) 

6. The existence of a unit-scalar 1 which satisfies 

\a l = a 1 (i= 1,2,..., AT). (3.14) 


The above definition of the AT-dimensional vector space implies certain properties of 
objects in it. The co-linear or parallel vectors in the Af- dimensional vector space are the 
vectors a and b which are linearly dependent, that is, such that 


a 1 = Xb l ( i = 1,2,.. . , AT). 


(3.15) 
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A straight line in the vector space is defined by the equation 

x‘ = a ' + kb' (i = 1, 2, . . . ,1V), (3.16) 

where k is the variable parameter. The vector a determines one point on the straight line, 
while the vector b determines the set of coefficients of the direction of the straight line. A 
plane in the vector space is defined by the equation 

x‘ = a' + kb' + vc' (i — 1,2, .. . , AT), (3.17) 

where k and v are variable parameters. The vector a determines one point on the plane, 
while the vectors b and c are the direction vectors of the plane. 


3.3 The Euclidean Metric Space 

If the distance between two arbitrary points P{a l ,a 2 , . . . , a N ) and Q{b l ,b 2 , . . . , b N ) of a 
vector space is defined by the equation 

5 = PQ = jSfo 1 — a 1 ) 2 + ( b 2 — a 2 ) 2 4 4- — a w ) 2 J ^ (3.18) 

such a vector space is called the Euclidean metric space. In the case of two infinitesimally 
close points P(y l ,y 2 , . . . ,y N ) and QCy 1 4- d y\y 2 4- dy 2 , . . . ,y N + dy N ), the distance ds is 
given by 

ds 2 = (dy 1 ) 2 4- (dy 2 ) 2 4- • • • 4- (djA) 2 , (3.19) 

or 

ds 2 = Si c jdy k dyl (k,j = 1,2,... ,N). (3.20) 

The expression (3.20) is called the square of the line element ds or the metric of the 
Euclidean metric space. The Euclidean metric space is a special case of the general vector 
space. Thus, the operations of addition (subtraction) and scalar multiplication, satisfying 
the axioms (3.8)— (3.14), are defined in the Euclidean metric space. However, in a metric 
space there is another operation with vectors called the scalar product, defined as a 
composition of vectors a) and tf, that is, 

= a 1 !) 1 4- a 2 b 2 4- • • • 4- a N b^ . (3.21) 

The Euclidean metric space in N dimensions is usually denoted by En. 


3.4 The Riemannian Spaces 

In the previous section, we have concluded that in an Euclidean metric space En, the 
metric is defined by 


ds 2 = &kjdy k dyi {k,j = 1, 2, . . . , N) 


(3.22) 
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where y k are Descartes rectangular coordinates. If, instead of the Descartes coordinates 
y k , we introduce N arbitrary generalized coordinates x k , by means of the equations 

y k = y k (x 1 ,x z ,...,x N ) (fc= 1,2,..., IV), (3.23) 


then, due to the relations 


dy fc = dx m (k,m = 1,2 N), 

j dxtn 


the metric can be written as follows: 

" k 3 y k 3 y 

' 3x m 3x" ^dx™dx" 

If we introduce a notation 

dy k dyi 

8mn ~ Skj dx™dx"’ 

the metric can be written in the form 

ds 2 = g mn dx m dx" (m, n = 1, 2, . . . , AT). 


ds 2 = hi— dx" ! — dx" = % — — dx m dx". 


(3.24) 


(3.25) 


(3.26) 


(3.27) 


From (3.26) it is clear that g m n, in general, is not equal to the 8 -symbol and that (3.27) 
cannot be reduced to the sum of squares of differentials of N coordinates. Thus, (3.27) 
is a general homogeneous quadratic form. Let us now, as an example, consider the 
usual three-dimensional Descartes system of coordinates y k = \x,y,z\ and the system of 
spherical coordinates x k = {r, 6, cp), defined by the equations 

y 1 = x 1 sinx 2 cosx 3 , 

y 2 = x 1 sinx 2 sinx 3 , (3.28) 

y 3 = x 1 cosx 2 . 

The components of the system g mn in spherical coordinates are obtained using the 
definition (3.26) as follows: 



= (sinx 2 cosx 3 ) 2 + (sinx 2 sinx 3 ) 2 + (cosx 2 ) 2 = 1, 



= (—x 1 sinx 2 sinx 3 ) 2 + (x 1 sinx 2 cosx 3 ) 2 = (x 1 sinx 2 ) 2 . (3.29) 


Similar considerations show that 


gmn = 0 for m jz n. 


(3.30) 
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Thus, the metric equation (3.27) becomes 

ds 2 = (dx 1 ) 2 + (x 1 ) 2 (dx 2 ) 2 + (x 1 sinx 2 ) 2 (dx 3 ) 2 , (3.31) 

or using the usual notation for the spherical coordinates 

ds 2 = dr 2 + r 2 dd 2 + r 2 sin 2 6 dip 2 , (3.32) 


which is the well-known expression for the metric in spherical coordinates. 

The expression (3.31) or (3.32) is a homogeneous quadratic form and it is not a sum of 
squares of differentials of the three coordinates. 

In general a space of N dimensions, in which a metric is defined with respect to the 
Descartes system of rectangular coordinates (3.22), does not cease to be an Euclidean 
metric space when the metric is expressed with respect to some other generalized system 
of coordinates (3.27) and is no longer a sum of squares of differentials of N coordinates. 
The space remains the same Euclidean metric space and only the system of coordinates is 
changed. 

Let us now consider a case when the Descartes coordinates y k (k = 1,2,..., N ) can be 
expressed in terms of a number M (M < N ) of arbitrary variables x“ (a = 1,2,..., M) 

y k =y k (x 1 ,x 2 ,... t x M ) (k = 1,2 N). (3.33) 


In such a way we define an M-dimensional subspace, denoted by Rm, embedded in the 
original IV-dimensional Euclidean metric space £jv- The metric of the Euclidean metric 
space has the form 

ds 2 = S kj dy k dyi (k,j = 1, 2, . . . ,7V), (3.34) 

and due to the relation 

dy k = — dx", (3.35) 

J dx a 

the metric of the subspace Rm can be written in the form 

ds 2 = g a p dx" dx p (a, p = 1, 2, . . . ,M), (3.36) 


where 

3 y k dyi 

^~ Sk J a&W 


(3.37) 


In (3.37) and in the following text the Greek indices run from 1 to M while the Latin indices 
run from 1 to N. 

Thus, the square of the distance between two infinitesimally close points in the 
subspace Rm is defined by the homogeneous quadratic differential form (3.36) . Let us now 
consider the M-dimensional subspace Rm independently of the original N - dimensional 
Euclidean metric space E/v, in which it is embedded. Now we would like to know whether 
it is possible to find M independent variables and use them as coordinates, such that 
the metric form (3.36) can be written as the sum of squares of differentials of these 
coordinates. If this is possible, the M-dimensional space Rm is in its own right an 
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Euclidean metric space. If this is not possible the M-dimensional space Rm is called the 
Riemannian space. 

Let us, for example, consider the points on a sphere of a unit radius in the three- 
dimensional Euclidean metric space £ 3 , defined by the Descartes rectangular coordinates 
y l = {x,y, zj. The metric has the usual form 

ds 2 = S kj d y k dyi ( k,j = 1, 2, 3). (3.38) 

Let us now define a two-dimensional subspace R 2 of the Euclidean metric space £ 3 , 
in which the position of the points on the unit sphere is specified by the polar angles 
jc “ = {6, <pj. The relations ( 3 . 33 ) in this case have the form 

y 1 = sinx 1 cosx 2 , 

y 2 = sinx 1 sinx 2 , (3.39) 

y 3 = cosx 1 . 

The components of the system g a p are obtained using the definition ( 3 . 37 ) as follows: 



= (cosx 1 cosx 2 ) 2 + (cosx 1 sinx 2 ) 2 + (— sinx 1 ) 2 = 1 



= (— sinx 1 sinx 2 ) 2 + (sinx 1 cosx 2 ) 2 = (sinx 1 ) 2 
gi 2 = £21 = 0. (3.40) 

Thus, we obtain the metric of the subspace £2 in the form 

ds 2 = (dx 1 ) 2 + (sinx 1 ) 2 (dx 2 ) 2 , (3.41) 


or 

ds 2 = d6 2 + sin 2 6 dip 2 . (3.42) 

It turns out that it is not possible to find two real variables { u, v} such that the metric (3.42) 
can be written in the form 

ds 2 = dw 2 + dv 2 . (3.43) 

Thus, the surface of the unit sphere, as a two-dimensional subspace £2 of the original 
Euclidean metric space £ 3 , does not have the internal Euclidean metric. In other words, 
there are no Euclidean coordinates on the surface of the unit sphere. 

It can, therefore, be concluded that in an Euclidean metric space there are some 
subspaces with a non-Euclidean metric. Such subspaces are called the Riemannian spaces. 
In principle, the metric geometry can be generalized by defining the metric of a space in 
advance by choosing the functions g mn in an arbitrary way, with only requirements that 


Chapter 3 • Vector Spaces 19 


the system g mn is symmetric and doubly differentiable. The metric does not even have to 
be positively defined. 

Thus a space, with a metric that is not positively defined but the components of the 
system g mn are constants, is sometimes called the pseudo -Euclidean space. Analogously, 
a space that is not positively defined and the components of the system g mn are arbitrary 
functions of coordinates, is sometimes called the pseudo -Riemannian space. 




Definitions of Tensors 


4.1 Transformations of Variables 


Tensors, as mathematical objects, were originally introduced for an immediate practical 
use in the theory of relativity. The main subject of the theory of relativity is the behavior 
of physical quantities and the laws of nature with respect to the transformations from 
one system of coordinates to another. Therefore, it was important to introduce a new 
class of mathematical objects which are defined by their transformation laws with respect 
to the transformations from one system of coordinates to another. Such mathematical 
objects are called tensors. The systems, that we call tensors, have linear and homogeneous 
transformation laws with respect to the transformations of coordinates. In order to define 
the main types of tensors, let us consider an arbitrary transformation of variables x k into 
some new variables z k , defined by 

z k = z k (x 1 ,x 2 xA) (k=l,2,...,N). (4.1) 


From (4.1) we may write 


d z k = — dx” 
dx m 


(. k , m= 1,2, .. . , TV). 


(4.2) 


The differentials of the coordinates d z 1 are linear and homogeneous functions of the 
differentials of the old coordinates dx m . The differentials of coordinates are by definition 
treated as the components of a special type of tensors, called contravariant vectors. 

Let G be a zeroth-order system. The system of N quantities dG/dx l is transformed 
according to the transformation law 


9G 

a? 


dx m 9G 
~dzFdx™ 


( k , m 


1,2 N). 


(4.3) 


The components dG/dz k are linear and homogeneous functions of the components 
dG/dx m . The components of the system dG/dx m are by definition treated as the compo- 
nents of a special type of tensors, called covariant vectors. 


4.2 Contravariant Vectors 

Generally speaking, any system of quantities, defined with respect to the systems of 
coordinates {x k } and {z k \ by N quantities A k and A k respectively, which is transformed 
according to the transformation law 
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Z 1 

dx m ~ 


- k dz k „„ 
Ac = A 


( k , m = 1,2, ... ,N) 


(4.4) 


is called a contravariant vector. The contravariant vectors are always denoted by one upper 
index. 


4.3 Covariant Vectors 

On the other hand, any system of quantities, defined with respect to the systems of 
coordinates {x k } and {z k } by N quantities B k and B k respectively, which is transformed 
according to the transformation law 

dx m 

B k = —rB m (k, m = 1, 2, . . . , N) (4.5) 

9z fc 

is called a covariant vector. The covariant vectors are always denoted by one lower index. 


4.4 Invariants (Scalars) 

Let us now form a zeroth-order system by composition of one contravariant and one 
covariant vector 


F=A m B m (m = 1,2,... ,1V), 


(4.6) 


with respect to a system of coordinates {x k }. Then, with respect to a new system of 
coordinates {z k } this system has the value 


9z fc . m dx n 


dz k 9x" 


F = A B k = A — r B n = r A B n , 

dX m dZ k dx m dZ k 


(4.7) 


where the upper bar over the quantities F, A k and B k denotes that they are defined with 
respect to the new coordinates {z k } 

On the other hand, by definition of the 5-symbol, we have 


dz k dx n , 

r = 5m (C m,n= 1,2, ... , N ), 

dx m dz k m 


(4.8) 


and the result (4.7) becomes 


F = CA m B„ = A m B m , 


(4.9) 


or 

F = A k B k = A m B m = F. (4.10) 

The result (4.10) shows that the quantity F has the same value in all systems of coordinates. 
Such a quantity is, therefore, called an invariant or a scalar. A composition of one 
contravariant and one covariant vector is therefore called the scalar product since it 
behaves as a scalar with respect to an arbitrary transformation of transformation of 
coordinates. 


Chapter 4 • Definitions of Tensors 23 


4.5 Contravariant Tensors 


Let us consider a system of N 2 products of components of two contravariant vectors B m 
and D n , denoted by 

A mn = B m D n t ( 4 . 11 ) 


with respect to a system of coordinates {x k }. In some other system of coordinates \z k ) this 
system will have values in accordance with (4.4), that is, 


A jk = y D k = ^ t B ’n D n = A n 


dX m dX n 


dX m dx' 1 


( 4 . 12 ) 


In analogy with (4.12), any second-order system, defined with respect to the systems of 
coordinates {x k } and {z k \ by N 2 quantities A'”” and A! k , respectively, which is transformed 
according to the transformation law 


A jk _ dzJ dz yn n 
~ dx m dx n 


( 4 . 13 ) 


is called a second-order contravariant tensor. The second-order contravariant tensors are 
always denoted by two upper indices. 

A third-order contravariant tensor is a system of N 3 quantities, denoted byA mn P, which 
is transformed according to the transformation law 


A ‘Jk _ 


dz l dzJ d Z k 
dx m dx n dxP 


( 4 . 14 ) 


Analogously with the definitions (4.13) and (4.14), it is possible to define contravariant 
tensors of arbitrary order. By convention, any contravariant tensor is denoted by a number 
of upper indices only. 


4.6 Covariant Tensors 

A system defined with respect to the systems of coordinates {x fc } and {z k } by N 2 quantities 
A mn and A jk , respectively, which is transformed according to the transformation law 

dx m dx n 

Aik= JJdz* mn (4J5) 

is called a second-order covariant tensor. Second-order covariant tensors are always 
denoted by two lower indices. 

A third-order covariant tensor is a system of N 3 quantities, denoted by A mnp , which is 
transformed according to the transformation law 


- _ dx m dx n dxP 

Aijk ~UJdw Amnp - 


( 4 . 16 ) 


Analogously with the definitions (4.15) and (4.16), it is possible to define covariant tensors 
of an arbitrary order. By convention, any covariant tensor is denoted by a number of lower 
indices only. 
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4.7 Mixed Tensors 


Let us consider a system, defined with respect to the systems of coordinates {x fc } and 
{z k } by N 4 quantities A’jff and A 1 ^ respectively, which is transformed according to the 
transformation law 


- ij dz‘ dz! dX? dX 5 
kl dx m dx n dz k dz l ps ' 


(4.17) 


The system A™ n is called a fourth-order mixed tensor, that is, a second-order contravariant 
and second-order covariant tensor. The indices m and n are contravariant indices and 
indices p and .s' are covariant indices. In analogy with (4.17) it is possible to define mixed 
tensors with arbitrary numbers of contravariant and covariant indices. There are two more 
types of fourth- order mixed tensors 


A m. .mnp 
nps > 


(4.18) 


Using the transformation law (4.17) it is easy to construct the transformation laws for 
arbitrary mixed tensors. The contravariant and covariant tensors can, of course, be treated 
as special cases of mixed tensors. 


4.8 Symmetry Properties of Tensors 

A second-order covariant tensor Aj k is called a symmetric tensor if its components satisfy 
the equality 

A jk =A kj (/, fc = 1, 2, . . . ,1V), (4.19) 

and the tensor Aj k is called an antisymmetric tensor if its components satisfy the equality 

A ]k = —A k j (j,k = 1,2, , AT). (4.20) 

Analogously, a second- order contravariant tensor A! k is called a symmetric tensor if its 
components satisfy the equality 

A jk = A kj (j,k=l,2,...,N), (4.21) 

and the tensor A! k is called an antisymmetric tensor if its components satisfy the equality 

A i k = _ A kj (/, k = 1, 2, . . . , N). (4.22) 

It is important to note that the symmetry properties of tensors are independent on 
the coordinate system, in which the tensor components are defined. In other words, a 
tensor which is symmetric (antisymmetric) with respect to one coordinate system remains 
symmetric (antisymmetric) with respect to any other coordinate system. 

In order to show that it is the case, let us assume that an arbitrary second-order con- 
travariant tensor, defined with respect to the coordinate system {x k } by N 2 components 
A mn , satisfies the equality 


A mn =A nm n = 1,2,... ,N) 


(4.23) 


Chapter 4 • Definitions of Tensors 25 


which means that it is a symmetric tensor in the coordinate system {x k }. In some other 
coordinate system {z k }, this tensor is given by the components 


J^jlc _ dzI dz k ' „ 
dX m dX n 


( 4 . 24 ) 


However, by means of an interchange of the dummy indices m ** n, which is just a change 
of notation that does not affect the result of summation, and using (4.23) we obtain 


dz 1 3 z k 


3 z k dz 1 


Ajk _ 04j Mim _ 0-6 0/y Atnn _ Akj 

~ dX n dX m ~ dx m dx" 


( 4 . 25 ) 


which shows that the tensor A jk in the new coordinate system \z k ) is indeed symmetric 
as well. 

In general, if a tensor of an arbitrary order and type is symmetric (or antisymmetric) 
upon interchange of one pair of its indices (both lower or both upper indices) in one 
system of coordinates {x k }, it remains symmetric (or antisymmetric) upon interchange 
of the corresponding pair of indices in any other system of coordinates {z k }. In other 
words, the symmetry properties of tensors are independent on the coordinate system, in 
which the tensor components are defined. 


4.9 Symmetric and Antisymmetric Parts of Tensors 

Let us consider a second-order covariant tensor A mn . This tensor can always be written as 
a sum of one symmetric and one antisymmetric tensor as follows: 


_ 1 1 

A mn - -{A mn +A nm ) + -(A m n-A nm ), 


( 4 . 26 ) 


or 


Amn — A ( mn ) -f- Af mn ]- 


( 4 . 27 ) 


The symmetric tensor defined by the expression 


A(mn) — ^ ( Amn T A nm ) 


( 4 . 28 ) 


is called the symmetric part of the tensor A mn , while the antisymmetric tensor defined by 
the expression 


A[nm] — ^ ( Amn 


" A nm ) 


( 4 . 29 ) 


is called the antisymmetric part of the tensor A mn . 

Analogously to (4.27), it is possible to use an arbitrary non-symmetric third-order 
covariant tensor A mnp to create a totally symmetric and totally antisymmetric part as 
follows: 
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A[mnp\ — } (A rnnp A m pn + A pmn Ap nm + A n p m A nm p ) 


(4.31) 


From (4.30) and (4.31) it is easily seen that any interchange of indices m, n and p leaves 
A(mnp) unchanged, while itreverses the sign oiA[ mnp ]. The expressions (4.30) and (4.31) can 
be rewritten in a more compact form by introducing a special label for the permutations 
of the indices m, n and p, as follows: 


nj(m,n,p) (j = 0, 1,2, 3,4, 5). 


(4.32) 


The components of the system (4.32) are the 3! permutations of the three indices m, n, and 
p, which can be listed as follows: 


jro (m, n, p) = mnp, 
ni(m, n, p) = mpn, 

TZ2(m, n, p) = pmn, 

P P (4.33) 

jT3(m, n, p) = pnm, 

7t4(m, n, p) = npm, 

jt 5 (m, n, p) = nmp. 


Using (4.32) and (4.33), the expressions (4.30) and (4.31) can be rewritten in the following 
more compact form: 


3!— 1 


\mnp) 


= -YA n 

Q1 n j 


o, / .--Jd(mnp)> 

j=0 


(4.34) 


A 


[mnp] 


i 

3! 


3!— 1 

y ' (— 1 )^jzj(m n pi , 
1=0 


(4.35) 


valid for the third-order covariant tensor A mnp . 

The expressions (4.34) and (4.35) are easily generalized to the case of the ATh- order 
non- symmetric tensor, where the totally symmetric and totally antisymmetric parts are 
given by 


^ N \- 1 

A(iii 2 ~i N ) — Y 'idik—iN )’ 

■ 1=0 


(4.36) 


^ N \- 1 

A[iiiz— ii\r] = Yh 
■ 1=0 


(4.37) 


4.10 Tensor Character of Systems 

Sometimes we do not readily know the transformation laws for all systems encountered in 
different expressions. In order to be able to determine whether a given system is a tensor 
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or not, we need some criteria for determination of the tensor character of systems. These 
criteria can be based on the following statement: 

If an expression A k B k is invariant with respect to the coordinate transformations and 
we know that A k transforms as a contravariant vector (or that B k transforms as a 
covariant vector), then we know that the system B k is a covariant vector (or that the 
system A k is a contravariant vector). 


In order to prove the above statement, let us begin with the assumption that the 
expression A k B k is an invariant, that is, 

A!Bj - A k B k = 0. (4.38) 

If we then know that 7^ is a contravariant vector, we know that it transforms as 

A> = — ^ . (4.39) 

dx k 

Substituting (4.39) into (4.38) we obtain 

^A k Bj - A% = i^- k Bj - B^j = 0. (4.40) 

As the equality (4.40) is valid for an arbitrary contravariant vector A k , we have 


or 


—Bj-B k = 0, (4.41) 


Bj = -^-B k . 
1 dzi 


(4.42) 


The expression (4.42) is the transformation law for a covariant vector B k , and it shows that 
B k is indeed a covariant vector. In the same way it can be shown that, if the expression 
AmnB m D n is invariant with respect to the coordinate transformations and we know that 
B m andD' ! are two different contravariant vectors, then A mn is the second-order covariant 
tensor. This is valid even if, as a special case, the expression A mn B m B n is an invariant for 
an arbitrary contravariant vector B k , provided that it is known that A mn also satisfies the 
condition of symmetry 


A m n — A nm (m, n — 1,2,..., AT). 


(4.43) 


These conditions can be generalized to tensors of an arbitrary order and used as the 
criteria for determination of the tensor character of systems. 

As an example of the above rules, let us consider a mixed second-order system <5j. Let us 
take the numbers <5j as the coordinates of a second- order mixed tensor with respect to an 
arbitrary coordinate system {x fc }, which is always possible. The question is whether they 
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will keep their values, that is, whether they will remain the coordinates of a S -symbol, after 
the transformation to some new coordinate system {z k }. Thus, we may write 


m dz l dx n 

g' = g m 

1 " dx m dzi 


or, since {z k } is a system of mutually independent coordinates, we have 


(4.44) 


3 z l dx m 3 z l / 

£ l ^ l 

i ~ dx m dzi ~ dzi ~ i' 


(4.45) 


The result (4.45) shows that the 8 -symbol is indeed a second-order mixed tensor, which 
has the same coordinates in all coordinate systems. 


Relative Tensors 


5.1 Introduction and Definitions 


The tensors defined in Chapter 4 are sometimes also called the absolute tensors, since 
there are systems of quantities which, upon the transformations of coordinates, transform 
according to the similar but somewhat more general laws. Such systems are called relative 
tensors or pseudotensors. Thus, a fifth-order system, three times contravariant and twice 
covariant 


hjhf (ii, i 2 , 13,71,72= 1 , 2 , ...,1V) 


( 5 . 1 ) 


is defined as a relative tensor or pseudotensor of weight M, if it transforms according to 
the transformation law 


4 *1*2 *3 

nh 


w M A , mm2m3 dz^_ dz*_ dz^_ dx^_ dx^_ 
dz s ni " 2 9x m i dx m2 dx m2 dzh dzh ' 


( 5 . 2 ) 


In (5.2) we may introduce a notation 



( 5 . 3 ) 


for the Jacobian of transformation of the original coordinates {x k } into the new coordinates 
{z k }. In analogy with the definition (5.2), it is possible to define the relative tensors of an 
arbitrary weight, order, and type. The concept of relative tensors includes the absolute 
tensors, defined in Chapter 4, as a special case. The absolute tensors can be treated as 
relative tensors of weight zero. 

In particular, the relative tensors of weight M = + 1 are called the tensor densities, while 
the relative tensors of weight M = — 1 are called the tensor capacities. 


5.2 Unit Antisymmetric Tensors 


As an important example of relative tensors, we consider the e-symbol in three dimensions 
with three upper indices, that is, 

e ijk (i } j, k — 1 , 2 , 3 ). ( 5 . 4 ) 


If we assume that e'i k is a relative tensor of an unknown weight M, then it transforms 
according to the transformation law 


dX n 


dz m 


dX l dxl dx k 


( 5 . 5 ) 
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On the other hand, by definition of the Jacobian of the transformation A, we have 


dx n 

dz™ 


dx 1 

dx 1 

ax 1 




dZ 2 

dx 2 

ax 2 

3 

dx 2 

dX l 

dxi 

dx k 

dz 1 

dx? 

a? 

ax 5 

a? 

ax 5 

= 

d Z 2 

az 3 

dz 1 

aF 

a? 





(5.6) 


The Jacobian of the inverse transformation is given by 


A’ 1 = 


dz n 


dx n 


= e' 


,ijk 


dZ 1 dZ 2 9 Z 3 
dX l dxj dx k 


Let us now, for the moment, consider the following system: 


(5.7) 


A rst = e i]k a r a s a t_ (5 . 8 ) 

for an arbitrary mixed system a ™ . In the expanded form this system looks like 

A rst = a^ai^a^ — a^a^aj, + a^ala^ — a 2 a s 2 a\ + a 2 a s 3 a[ — a^ala^. (5.9) 

From (5.9), we see that the system A rst is fully antisymmetric system with respect to 
its three indices, in the space of three dimensions. However, an arbitrary third-order 
antisymmetric system in three dimensions has only one independent component, that 
is, A 123 , while the other five non-zero components are determined by the conditions of 
full antisymmetry. Thus, we may write 


j^rst ^123 ^rst 


(5.10) 


In other words, any fully antisymmetric third-order system in three dimensions is propor- 
tional to the corresponding e-symbol. Using (5.8) and (5.10) we obtain 

eWa^ajal = e nt e i i k a]a 2 a i k . (5.11) 

Using (5.11) we may write 


eijk dz^dz^dz<_ = e „, e y fc azi az2 az3 
dX l a xi dx k dx‘ dxi dx k ' 


(5.12) 


or using (5.7) 


e p3 z^az^ 
dx l dxj dx k 


e rst A~ l . 


(5.13) 


Substituting (5.13) into (5.5) we obtain 

- e rst = A M e rst A - 1 . 


(5.14) 


From (5.14) we see that if we choose M—+ 1, the components of the system e rst remain 
unchanged upon the coordinate transformations. Thus, the system e'i k is a third- order 
relative contravariant tensor with the weight M = + 1 (tensor density) . The transformation 
law of the unit antisymmetric system is therefore given by 


dx n 


dz m 


_ Ae i jk dJid^—. 

dx l dxj dx k dx 1 dxj dx k 


(5.15) 
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In a similar way, we can consider the e-symbol with three lower indices in three 
dimensions 


eijk U,j,k= 1 , 2 , 3 ). 


( 5 . 16 ) 


Again, if we assume that ep is a relative tensor of an unknown weight M, then it transforms 
according to the transformation law 


Srst = 


dx 11 


dz m 


M dx 1 dxi dx k 

eijk dFdF~dF' 


Using an analog of (5.11) in the form 

eij k a l r d s a k t = e rst e ijk a[ci! 2 a^, 

we obtain 

dx 1 d X-i dx k dx 1 d xi dx k 

e ‘ jk dF dF FF ~ erste ‘i k FFdFFF’ 


or 


dx‘ dxi a X k _ 

e ‘ Jk dFdF FF ~ l ' rstA ' 


Substituting (5.20) into (5.17) we obtain 


erst = A M erst a . 


( 5 . 17 ) 

( 5 . 18 ) 

( 5 . 19 ) 

( 5 . 20 ) 

( 5 . 21 ) 


From (5.21) we see that if we choose M = —1, the components of the system e rst remain 
unchanged upon the coordinate transformations. Thus, the system ep is a third-order 
relative contravariant tensor with the weight M = — 1 (tensor capacity) . The transformation 
law of the unit antisymmetric system ep is therefore given by 


@rst 


dx n 1 dx 1 dxi dx k _j dx 1 dxi dx k 

d F" eijk dFdF~dF~ e ‘ jk dFdF~dF' 


( 5 . 22 ) 


In general, the transformation law of an arbitrary tensor is defined by 

1 . order — the number of indices; 

2. type — the position and order of indices; 

3 . weight — the exponent of the Jacobian of transformation. 


5.3 Vector Product in Three Dimensions 

Let us consider two vectors of the same type, for example, two contravariant vectors 
denoted by A 1 and B l , in a three-dimensional space ( i — 1,2,3). Using the relative tensor 
ep, we can define a first-order system 

Ci = e ijk AiB k (i,j, k = 1,2,3), 


( 5 . 23 ) 
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which is a first-order covariant relative vector with the weight M = — 1, since it is defined 
as a composition of one relative tensor of weight M = — 1 with two absolute vectors. The 
expression (5.23) can be expanded into three equations 

Cl = a 2 b 3 — a 3 b 2 , 

C 2 =A 3 B 1 -A l B 3 , (5.24) 

C 3 =A l B 2 - A 2 B 1 . 


The transformation law for the vector Q is given by 


C i = A- 1Cp ^, 
1 p dzi 

where the Jacobian of the transformation is given by 

dx n 

A = 


a z m 


(5.25) 


(5.26) 


In a similar way we may define a relative contravariant vector C l of the weight M = + 1, 
starting with two absolute covariant vectors Aj and Bj as well as the relative tensor e ijk , 
that is, 

C = e ijk AjB k ( i,j , lc = 1,2, 3). (5.27) 

The expression (5.27) can be expanded into three equations 

C 1 = A2B3 — A3B2, 

C 2 =A 3 B 1 -A 1 B 3 , (5.28) 

C 3 = A\B 2 — A 2 B\. 


The transformation law for the vector C l is given by 

O = A Cp|^. (5.29) 

dxP 

The product (5.23) or (5.27) is called the vector product of two contravariant or two 
covariant vectors in three dimensions, respectively. This definition includes the usual 
definition of the vector product of two vectors (assumed to be the position vectors) given 
by their Descartes rectangular coordinates. 


C=AxB= 


12 3 

A\ A 2 A 3 
B\ B 2 B 3 


(5.30) 


In the definition (5.30), 1, 2, and 3 are the unit vectors of the three mutually orthogonal 
axes of the Descartes coordinate system. It should be noted that, in the Descartes 
coordinate system, the contravariant and covariant coordinates of the vector product have 
the same numerical values, that is, 


d - 8> k C k . 


(5.31) 
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5.4 Mixed Product in Three Dimensions 

The mixed product of three contravariant vectors Ah /T , and CJ is formed by a composition 
of the vector product 

Dj = e ]km B k C m O', k, m = 1, 2, 3), (5.32) 


with the vector Ah It has the form 

V = A!Dj = e jkm A i B k C m (j, k, m = 1, 2, 3) 


or in the Descartes coordinate system 


V = 


A 1 A 2 A 3 
B 1 B 2 B 3 
C 1 C 2 C 3 


(5.33) 


(5.34) 


From the definition (5.33) it is evident that the zeroth-order system V is a relative 
invariant of the weight M = — 1, or the scalar capacity, since it is composed of three ab- 
solute contravariant vectors and the tensor capacity ej km . The transformation law for this 
system is 

V=A~ 1 V. (5.35) 


In the similar way, the mixed product of three covariant vectors A ; , Bj, and Cj is formed 
by composition of the vector product 

D i = e i km B k C m ( fc, m = 1, 2, 3), (5.36) 


with the vector Ay, and it has the form 

G = AjD J = e jkm AjB k C m ( j , fc, m = 1, 2, 3) 


or in the Descartes orthogonal coordinates 


G = 


Ai A 2 A 3 
B\ B 2 B 3 
Ci C 2 C 3 


(5.37) 


(5.38) 


From the definition (5.37) it is evident that the zeroth-order system G is a relative 
invariant of the weight M = +1, or the scalar density, since it is composed of three 
absolute covariant vectors and the tensor density e jkm . The transformation law for this 
system is 

G = AG. (5.39) 


Unlike absolute scalars, the systems V and G, as relative scalars, in general change with 
respect to the transformations of coordinates. The behavior of these systems with respect 
to a special class of the orthogonal transformations of Descartes coordinates will be 
discussed in the following section. 
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5.5 Orthogonal Coordinate Transformations 

In order to highlight some important properties of the vector products and mixed prod- 
ucts in three dimensions, we will consider the orthogonal transformations of Descartes 
coordinates rotation, translation, and inversion. 


5.5.1 Rotations of Descartes Coordinates 


Let us observe two Descartes coordinate systems K and K' with a common z- axis, denoted 
by 3 = 3', perpendicular to the plane of the paper. The system K' is obtained as a result of 
a rotation in the positive sense of the system K about a common 3-axis for some angle 9, 
as shown in Figure 5.1. 

The relation between the coordinates of a position vector of a given fixed point P, with 
respect to the coordinate systems K' and K, is given by 


z 1 " 


cos 9 sin0 0 


'x 1 ' 

z 2 

= 

— sin# cos 9 0 


X 2 

z 3 


0 0 1 


X 3 


z k =A k xi (j,k= 1,2,3). 

The transformation (5.41) is a linear transformation with the facobian 


A = 


dz k 

dxi 



= 1 . 


(5.40) 


(5.41) 


(5.42) 


Thus, the Euclidean metric, given by the analog of (3.18) in three dimensions, is invariant 
with respect to the rotations of the Descartes coordinates. This can easily be shown by 



FIGURE 5.1 Rotations of Descartes coordinate systems. 
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using the distance between the point P(x l ,x 2 ,x 3 ) from the origin 0(0, 0, 0), which in the 
coordinate system K is given by 

S = [(x 1 ) 2 + (x 2 ) 2 + (x 3 ) 2 ] 1/2 . (5.43) 

In the coordinate system K' the distance between these points is given by 

S' = [(z 1 ) 2 + (z 2 ) 2 + (z 3 ) 2 ] 1/2 . (5.44) 

Substituting (5.40) into (5.44) we obtain 

S' = ^(x 1 cos0 + x 2 sing) 2 + (— x 1 sin$ + x 2 cos#) 2 + (x 3 ) z j ^ (5.45) 


or 


S' = S. 


(5.46) 


Thus, the distance between the points in the Descartes coordinate system is invariant with 
respect to the rotation of coordinates about the 3-axis. The metric form of the space is also 
invariant with respect to the rotation of coordinates about the 3-axis, that is, we have 

ds' 2 = ds 2 . (5.47) 

It is easily shown that the above results are valid for arbitrary rotations of coordinates in 
the Descartes coordinate systems. 


5.5.2 Translations of Descartes Coordinates 


Let us consider two Descartes coordinate systems K and K' in three dimensions, where 
the coordinates in the system K' are denoted by {z k \ and the coordinates in the system K 
are denoted by {x fc }. The translation of the coordinates is given by the equation 

z k = x k + a k (fc= 1,2,3), (5.48) 


where a k is some constant translation vector. The Jacobian A of this transformation is 
given by 


A = 


3 z k 

3x7 



(5.49) 


Thus, the metric form of the three-dimensional space in Descartes coordinates is invariant 
with respect to the translation of the coordinate systems, since we have 


dz k = dx k (fc= 1,2,3). 


From (5.50) we have 

ds' 2 = Sjk d d d z k — Sjk dxi dx fc = ds 2 (j,k = 1,2, 3), 


(5.50) 


(5.51) 


which proves the invariance of the metric form with respect to the coordinate translations. 
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5.5.3 Inversions of Descartes Coordinates 

Let us again consider two Descartes coordinate systems K and K' in three dimensions, 
where the coordinates in the system K' are denoted by [z k ] and the coordinates in the 
system K are denoted by {x k }. The Inversion of the coordinates is given by the equation 

z k = - X k = -S k x j ( k = 1, 2, 3). (5.52) 

It can be shown that the inversion of Descartes coordinates cannot be achieved by means 
of any rotation of the original coordinate system K. The metric form is invariant with 
respect to the inversion, since we have 

ds' 2 = Sji- dz J dz k = Sjk d(— x k ) d(— x k ) = fy*. djd dx k = ds 2 . (5.53) 

The Jacobian of this transformation is equal to 

-Sf | = -1. (5.54) 



5.5.4 Axial Vectors and Pseudoscalars 

Thus, as a conclusion of this section, we note that rotation, translation, and inversion 
constitute a group of orthogonal transformations which leave the metric form invariant. 
Furthermore, the transformation laws of relative vectors and scalars (e.g., vector products 
and mixed products in three dimensions), with respect to rotations and translations of 
coordinates, are the same as those for the absolute vectors and scalars. However, relative 
vectors and scalars (e.g., vector products and mixed products in three dimensions) do not 
transform like absolute vectors and scalars, with respect to the inversion of coordinates. 

As we have concluded before, the transformation law of an absolute contravariant 
vector in three dimensions is given by 

A> = r A k ( j , k = 1, 2, 3), (5.55) 

dX K 

with respect to the coordinate transformations from {x k } to {z k }. An example of this type 
of vectors is the polar vector of the position of a certain point in a three-dimensional 
space. However, the vector product of two absolute covariant vectors Aj and Bj is a relative 
contravariant vector O, which transforms according to the transformation law 

O — A —r C k (j,k= 1,2,3). (5.56) 

dx k 

Substituting (5.42) or (5.49) into (5.56) we see that the vector product is transformed in 
the same way as the absolute vectors (5.55), with respect to rotations and translations. 
On the other hand, the vector product reverses sign with respect to the inversion of 
coordinates and does not transform as the polar vectors. This difference between the 
vector product, as the relative vector, and the polar vectors, being the absolute vectors, 
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was noted in the three-dimensional vector algebra before the development of the Tensor 
Calculus. In the three-dimensional vector algebra, the vector product is called the axial 
vector as opposed to the position vector which is called the polar vector. From the tensor 
point of view this difference is easily understood, since it relates to the definition of the 
position vector as an absolute vector and the vector product as a relative vector. 

The mixed product in three dimensions, as a relative scalar, is invariant with respect 
to rotation and translation but it reverses sign with respect to inversion. In the three- 
dimensional vector algebra these scalars are usually called pseudoscalars. 




The Metric Tensor 


6.1 Introduction and Definitions 


As we have seen in the previous chapters, in the Euclidean metric space it is possible to 
reduce the metric form to a sum of squares of the differentials of the coordinates, that is, 
we may write 

ds 2 = Sjk dyi d y k ( j, k = 1,2,... ,N). (6.1) 

If, instead of the Descartes orthogonal coordinates {y k }, we introduce the arbitrary gener- 
alized coordinates {x k } by means of the equations 

y k = y k (x 1 ,x 2 ,...,x N ) (k = 1,2 N ), (6.2) 


we may write 

dy k = — dx m (k,m= 1,2,..., AT), 

and the metric form can be written as follows: 

ds 2 = g mn dx m dx n (m, n = 1,2,..., AT), 


(6.3) 


(6.4) 


where 

dyj dy k 

gmn - 


(6.5) 


is a symmetric second- order system. The square of the infinitesimal line element ds is 
by definition an invariant in all coordinate systems and dx' n is an absolute vector. Thus, 
using the criteria for the tensor character of systems and the fact that g mn is a symmetric 
system, we conclude that g mn is an absolute second-order covariant tensor. This tensor is 
called the metric tensor. The determinant of the matrix associated with the metric tensor is 
given by 


g — \gmn | — 


, dy j 3 y k 

Is., 1 

dyj 

dy k 


a y k 

Jk dx m dx n 

\°]k\ 

dx m 

dx n 


dx n 


( 6 . 6 ) 


where the multiplication rule for determinants has been used. 

Thus, the determinant of the metric tensor is equal to the square of the Jacobian 
of the transformation from the given Descartes coordinates to the arbitrary generalized 
coordinates. Assuming that the Jacobian of the transformation from the given Descartes 
coordinates to the arbitrary generalized coordinates is a non-zero real number, the 
determinant of the metric tensor is always a positive quantity, that is, we have g > 0. 
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As an example, using the results (3.29), the matrix form of the metric tensor in the 
system of spherical coordinates x k = {r, 6, cp} is given by 


[ gmn ] — 

and the determinant of the metric tensor is given by 


1 0 

0 (x 1 ) 2 


0 0 (x 1 sinx 2 ) 2 


(6.7) 


8 = 


1 0 0 

0 (x 1 ) 2 0 

0 0 (x 1 sinx 2 ) 2 


[(x 1 ) 2 sinx 2 j 2 = (r 2 sin#) 2 


( 6 . 8 ) 


As a system, the determinant g is a relative scalar invariant of the weight M — 2, which 
is easily shown as follows: 


g = 


3 y k 

2 

3 y k 

dx m 

2 

3 y k 

2 

3x m 

2 

= 8 

dx m 

dzi 


dX m 

dzi 


dX m 


dzi 

dzi 


or 


g = g&. 


(6.9) 


( 6 . 10 ) 


which proves that the determinant g is indeed a relative scalar invariant of the weight 
M = 2. From (6.10) we see that x /g is a relative scalar invariant of the weight M — 1, 
that is, 

Vg=VgA- (6.11) 

Let us use G mn to denote the cofactor of the element g mn in the determinant \gmn\- Then 

according to the determinant calculation rules, we have 

g = gmnG mn . (6.12) 

The adjunct matrix to the matrix [g mn \ , which is denoted by adj [gmn ] , is by definition the 
transposed matrix of the co-factors, that is, [(2""] = The matrix, inverse to the 

matrix [g m „], is, therefore, given by 

. r n adj [gmn] [G m "] T 

inv [g mn \ = t i = 1 1_. (6.13) 

8 g 

By definition, a product of a matrix with its own inverse is equal to a unit matrix. Thus, we 
may write 

iT 

(6.14) 


[ r^mny 

[gmn] = 1, 


8 


or using the system notation 


~<nk 


§mn 


— g mn — °m- 

o o 


8 


(6.15) 
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Introducing the notation 


the expression (6.15) becomes 


~,nk 


G nk 

g 


gmng nk 


(6.16) 


(6.17) 


From (6.17), we see that the system g nk is a second-order contravariant tensor, which is 
usually called the contravariant metric tensor. Analogously to the covariant metric tensor, 
the contravariant metric tensor is also a symmetric tensor, that is, we have 

g mn _ gUm ' ( 6 . 18 ) 


Using (6.17) and the multiplication rules for the determinants we find 



(6.19) 


or 


lg'^1 = 1 _ I. 

|gmn| g 


( 6 . 20 ) 


The determinant of the contravariant metric tensor is a relative scalar invariant of the 
weight M = —2, which is easily shown using (6.10) 

1 1 P 

- = — A -2 , (6.21) 

g g 

or 


\g mn \ = \g mn \ A - 2 . (6.22) 

Since the antisymmetric unit system e 11 ' 2 '"‘ N is a relative contravariant tensor of the weight 
M= + 1, it is possible to form an absolute contravariant antisymmetric tensor e !li2 " iJV , 
using yg, as follows: 

ghh—iN _ __ e hh -iN (6.23) 

Vg 

On the other hand, since the antisymmetric unit system e^...^ is a relative covariant ten- 
sor of the weight M = — 1, it is possible to form an absolute unit covariant antisymmetric 
tensor e ili2 ...i N , using yg, as follows: 

e hh-iN = ■sfg^hh—iN ■ (6.24) 


The absolute tensors defined by (6.23) and (6.24) are called the Ricci antisymmetric tensors 
in the space of N dimensions. 


6.2 Associated Vectors and Tensors 

In a metric space, the contravariant and covariant tensors can be transformed to each 
other using the metric tensors g mn and g mn . In general the upper indices can be lowered 


42 TENSORS, RELATIVITY, AND COSMOLOGY 


and the lower indices can be made to be upper indices, using the metric tensors. For 
example, a covariant vector 

Am = gmnA n , (6.25) 

derived from a contravariant vector A n using the metric tensor g mn is called the associated 
vector to the contravariant vector A”. In the same way, a contravariant vector 

A m = g mn A n , (6.26) 

derived from a given covariant vector A n using the metric tensor g mn , is called the 
associated vector to the vector A n . 

For the associated vectors, the following rules are valid: 

1 . The association relation of two vectors is reciprocal. If a vector A m = g mn A n is 
associated to the vector A”, then the associated vector to the vector A m is the vector 
A". This can be shown as follows: 

g nm Am = g nm g mk A k = & n k A k = A". (6.27) 

2 . The absolute square of a contravariant vector A m or a covariant vector A m is the scalar 
(inner) product of a vector and its associated vector, that is, 

A 2 = A m A m = g mn A m A n . (6.28) 

3 . From the above rule it is evident that the absolute squares of the associated vectors 
are equal to each other. 

4 . The scalar product of the vectors A k and B k is equal to the scalar product of the vectors 
A m and B m and it is invariant with respect to an arbitrary coordinate transformation. 
This can be shown as follows: 

A k B k = g km A mgkn B n = 8™A m B n = A rn B m . (6.29) 

The above rules allow us to consider the vectors A m and A m as the covariant and 
contravariant coordinates of the same vector, which we may denote by A. From (6.29) 
we see that, in the metric space, it is possible to define the scalar product of two vectors 
A and B regardless of their type, that is, 

AB= A k B k = A m B m = g mn A m B n . (6.30) 

The analogous rules apply to the tensors of an arbitrary order and type. By composition 
with the metric tensors g mn and g mn , the upper indices are lowered and the lower indices 
are turned to the upper indices respectively. All tensors, created from each other by 
composition with one or more metric tensors, are called associated tensors. Thus, for an 
arbitrary second-order covariant tensor a mn we can create three associated second-order 
tensors a mn , a™, and a" m , as follows: 


Chapter 6 • The Metric Tensor 43 


It should be noted that the tensors a™ and a” v associated to the tensor a mn as defined 
in (6.31), are in general not equal to each other. They are equal to each other only when 
the covariant tensor a mn is symmetric, that is, when a mn = a nm . If we apply (6.31) to the 
metric tensor itself, we first note that the definition of the associated contravariant metric 
tensor turns to identity 

g mn = g mk g nj g kJ = g' nk S n k = g mn . (6.32) 

Secondly, since the metric tensor is a symmetric tensor, there is a unique mixed metric 
tensor equal to the corresponding 5-symbol 

gri — g mk gkn ~ (6-33) 

in agreement with (6.17). 

6.3 Arc Length of Curves: Unit Vectors 

Let us consider a Riemannian metric space in N dimensions, described by a system of N 
generalized coordinates {x ! }. If these coordinates are functions of an arbitrary parameter 
t, then a curve in this Riemannian space may be specified by N parameter equations 

x k = x k (t) (k=l,2,...,N). (6.34) 

The square of an infinitesimal arc length element of the curve between the points x k and 
x k + dx k is given by 

ds 2 = g mn dx'" dx" (m, n = 1, 2, . . . , N). (6.35) 

The infinitesimal arc length element itself is given by 

ds = yjgmn dx' n dx" {m, n = 1, 2, . . ., N), (6.36) 

and its derivative with respect to the parameter is given by 

ds / dx m dx" , 

dt = y gmn ~df^f «n,n = l,2,...,N). (6.37) 

The arc length of a curve from some reference point, with a parameter value of to, to some 
arbitrary point, with a parameter value of t, is then equal to 

rt I dx n 

s(t) = L y 8mn ~dF^r dL (6 - 38) 

From (6.35) we see that we may write 

dx' n dx" - 

g ' nn ~ds — ds~ = Smnk m k n = X m X m = X ■ X = 1, (6.39) 

where X m is a vector defined by the expression 

,» _ dx m 
^ “ “ds" 


(m = 1,2, . . . ,N). 


(6.40) 
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From (6.39) we see that vector X m has the absolute square equal to unity. Thus, the absolute 
value of the vector X m , denoted by |A,| is also equal to unity. 

|A.| = VTI=1. (6.41) 

A vector with the absolute value equal to unity is called a unit vector. The vector X m is a 
unit contravariant vector. As x m are the coordinates of a point on a given curve and s is the 
length of the arc of that curve, the vector X m is the tangent unit vector to this curve. 

6.4 Angles between Vectors 

Let us now consider two polar unit vectors X m and /i m with a common origin and with the 
end points A and B, as shown in Figure 6.1. 

From Figure 6.1. we see that 

e m = - X m (m = 1, 2, . . . , AT). (6.42) 

Using the cosine theorem, the absolute square of the vector e m can be written in the form 
|e| 2 = |/q 2 + |A| 2 — 2 |/x| |A|cos6> = 2- 2cos6> = 2(1 - cos 9). (6.43) 

On the other hand, by definition, the absolute square of the vector e m has the form 

kl 2 = g mn e m e n = gmn(u m - X m )(u n - x n ), (6.44) 

or 

kl 2 = gmnU m U n + gmnX m X n - 2 g mn X m fi n - (6.45) 

Since X and u are unit vectors, we have 

gmnU m U n = gmnX m X n = 1, (6.46) 

and the result (6.45) becomes 

kl 2 = 2(1 -gmnX m fl n )- (6.47) 

Comparing the results (6.43) and (6.47) we find that the cosine of an angle between two 
unit vectors is given by the expression 



FIGURE 6.1 The angle 9 between unit vectors A m and /i m . 
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cos 0 = g mn k m ii n . 


(6.48) 


If we have two arbitrary contravariant vectors A m and B m , then they define two directions 
with unit vectors 


x m 



B n 

~B' 


(6.49) 


The angle between these two directions is, according to (6.48) , given by 


A m B n 

cos G=g mn ——. (6.50) 

An 

From (6.50) we also see that the orthogonality condition for two contravariant vectors 
A m and B" has the form 


g mn A m B n = 0 (m, n = 1,2,.. .,1V). 


(6.51) 


The angle formed by two curves at their intersection is an angle between their tangent 
unit vectors at the intersection point. For two curves given by the parameter equations 
x m = ) and x n = (p n (t), the angle formed by these curves at their intersection point is 

given by 


cose = 



dl/r m d<p n 

Smn df dt 

d \]f m d yjr n [~ d (p m d<p n 

d t 3F” V & mn d t d t 


(6.52) 


where g mn , and are all calculated at the intersection point of the two curves. From 
(6.51) we see that the orthogonality condition for the two curves at their intersection point 
is given by 


gmnf m <P n = 0 (m,n= 1,2 N). 


(6.53) 


6.5 Schwarz Inequality 

Let us consider two arbitrary contravariant vectors, denoted by A m and B m , and let us form 
a linear combination of these two vectors as follows: 

A m + aB' n = C m , (6.54) 

where a is an arbitrary real absolute scalar parameter. The linear combination C m itself is 
also a contravariant vector. The absolute square of the vector C m is given by the positive 
definite form 

|C| 2 = g mn C m C ", 

ICI 2 = g mn (A m + aB m KA n + aB n ), 

\C\ 2 = g m nA m A n + 2 agmn A m B n + a 2 gmnB m B n , 

\C\ 2 = |h| 2 + 2 ag mn A m B n + a 2 \B\ 2 . 

In order that the quadratic form (6.55) is non-negative for all values of the parameter a, 
the following inequality must be satisfied: 
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(g,nnA m B n ) 2 - \A \ 2 \B \ 2 < 0. (6.56) 

Using the notation A = |A| and B = |B|, the inequality (6.56) gives 

\gmnA m B' l \<AB. (6.57) 

The inequality (6.57) is known as Schwarz inequality and it is of importance in a number 
of branches of Mathematics and Physics. 


6.6 Orthogonal and Physical Vector Coordinates 

In this section we consider an important special case when the three-dimensional 
Euclidean metric space is defined by some set of orthogonal curvilinear coordinates 
{x k }, (k = 1, 2, 3). For such a system of curvilinear coordinates, in every point of space the 
following conditions are satisfied: 

gl2 = g23 = g31 = 0. (6.58) 


Thus, we may write 


gmn — (Ul, 11, M — 1, 2, 3), 


(6.59) 


where h\[ = hM(x k ) are some functions of coordinates {x k }. In this case the metric of the 
space can be written in the form 

ds 2 = (hi dx 1 ) 2 + (h 3 dx 2 ) 2 + (h 3 dx 3 ) 2 , (6.60) 


and the matrix of the metric tensor is a diagonal 3x3 matrix 


[gmrc] — 


(hi) 2 0 0 

0 (h 2 ) 2 0 

0 0 (h 3 ) 2 


(6.61) 


Let us now, as an example, consider the vector of generalized velocities in this coordinate 
system. The contravariant coordinates of the velocity vector are given by 


v 


m 


dx"' 

df 


(6.62) 


where dx m are the coordinates of a contravariant polar vector in any system of coordi- 
nates and dt is a absolute scalar parameter. Using (6.61) we can calculate the covariant 
coordinates of the velocity vector, as follows: 

Cm = gmnV n — h^8 mn v n . (6.63) 


As the generalized coordinates do not necessarily have the dimension of length (e.g., 
they can be the angular coordinates), the functions fi M = fi M (x fc ) are not necessarily 
dimensionless (e.g., they may have the dimension of length). From (6.63) follows that the 
dimensions of the contravariant and covariant coordinates of the velocity vector are not 
the same, and they are not the same as the expected dimension of the velocity vector 
(i.e., length/time). The coordinates of the velocity vector that do have the dimension of 
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velocity (i.e., length/time) are called the physical coordinates of the velocity vector. The 
physical coordinates can, in the special case under consideration, be obtained from the 
contravariant or covariant coordinates using the formulae 


V(m) = h M V m = V m . 

h M 


(6.64) 


As an illustration, let us consider the velocity vector in the spherical coordinates, where 

x 1 = r,x 2 = 0,x 3 = <p, (6.65) 


and 


hi = 1 , I 12 = r, h 3 = rsind. 


( 6 . 66 ) 


The contravariant coordinates of the velocity vector in the spherical coordinate system are 
given by 


The associated covariant coordinates 
system are given by 


o do o dip 

ir = 3 -, ir = — . 
df df 


(6.67) 


of the velocity vector in the spherical coordinate 


d/ 9 do 9.9 dw 

vi = — , v 2 = i 2 — v 3 = r 2 sm 2 0-f. (6.68) 

df df df 

From the results (6.67) and (6.68) we see that neither all the contravariant nor the covariant 
coordinates of the velocity vector have the dimension of velocity (i.e., length/time). 

The physical coordinates of the velocity vector in the spherical coordinate system, that 
is, the projections of the velocity vector to the directions of the curvilinear axes in a given 
space point, according to the definition (6.64) have the form 

dr do dip 

*'(!) = ^* y ( 2 ) = ''d7’ y ( 3 ) = rsm0 — . (6.69) 

In a similar way, we can construct the physical coordinates of an arbitrary vector with 
respect to an arbitrary orthogonal system of coordinates in three dimensions. This ap- 
proach can also be extended to an arbitrary N - dimensional space, but it is most commonly 
used in three or sometimes four dimensions. 



Tensors as Linear Operators 


Second-order tensors can be described as linear operators acting on vectors in metric 
spaces. An operator in the AT-dimensional metric space is defined by the way it acts on 
different vectors in the space. For example, in the expression 

B m = 0%A n (m, n = 1,2,.. . ,N), (7.1) 

the tensor O™ represents a linear operator, which in a unique way relates a vector B m to the 
original vector A m . This operator is called a linear operator since it satisfies the conditions 
of linearity and homogeneity, that is, 

0”\A n + B n ) = 0"‘A n + 0" ! £", 

0%(fiA n ) = /30™A n , (7 ' 2) 

where A" and B' 1 are arbitrary contravariant vectors and f J > is an arbitrary absolute scalar. 

Eigenvectors of an operator OJf are defined as those vectors .S’”, which retain the 
direction and only change the absolute value as a result of the action of the operator OJ" . 
In other words for the eigenvectors S n , we have 

0™S n =XS m (m, n = 1, 2, ... ,1V) (7.3) 


or 


(0%-\8%)S n = 0 (m,n= 1,2,..., N). (7.4) 

The system of homogeneous linear equations (7.4) has a non-trivial solution for S n only if 
its determinant is equal to zero, that is, 

det(0™ - XS™) = 0. (7.5) 


The N solutions for the parameter X of the algebraic equation (7.5) are called the eigenval- 
ues of the operator 0™ . Equation (7.5) is sometimes called the secular equation. 

As an example of the concepts defined above, let us consider a tensor OJf, defined in a 
three-dimensional metric space by the matrix 


m = 


1 0 5 
0-2 0 
5 0 1 


(7.6) 


Substituting (7.6) into (7.5) we obtain the equation for the parameter X in the form 


l-x o 5 

0 —2 — X 0 

5 0 1 - it 


= 0 . 


(7.7) 
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By expanding the determinant in (7.7) we obtain 

(1 - A)[(— 2 - !)(1 - M] + 5[ — 5( — 2 - !)] = (1 - MiG + 2)0 - 1)] + 250 + 2)] 

= O + 2) [25 - O - l) 2 ] = 0. (7.8) 

The solutions of (7.8) for the parameter! are the following: 

A + 2 = 0=>A = —2, 

25 — O - l) 2 = 0 =>• X = 1 ± 5. ( 7 - 9) 


Thus, the three eigenvalues of the operator O™ are equal to 

7.5 = —2, Xp = —4, Xq = +6. (7.10) 

The corresponding eigenvectors S n , P n , and Q“ are obtained from the matrix equations 

(7.11) 


(7.12) 


' 1 

0 

5 

"S 1 ' 


'S 1 " 

0 

-2 

0 

s 2 

= -2 

S 2 

5 

0 

1 

_s 3 _ 


_S 3 _ 

' 1 

0 

5 ' 

~ p 1 ' 


'P 1 ' 

0 

-2 

0 

p2 

= -4 

p2 

5 

0 

1 

_ P 3 _ 


_P 3 _ 

' 1 

0 

5 " 

r q 1 1 


'Q 1 " 

0 

-2 

0 

Q 2 

= +6 

Q 2 

5 

0 

1 

Q 3 


Q 3 


(7.13) 


By solving the matrix equations (7.1 1)— (7. 13), we obtain the normalized (unit) eigenvec- 
tors of the operator OJf, as follows: 


[S"] = 


' s 1 ' 


' 0 ' 

s 2 

= 

1 

s 3 


0 


(7.14) 


[P H ] = 


' P 1 ' 

1 

1 

p 2 

0 

p 3 


-1 


(7.15) 


[Q"] = 


r q 1 1 

1 

■ 1 - 

Q 2 


0 

Q 3 

” V2 

1 


(7.16) 


It is easy to show by direct substitution that the eigenvectors (7. 14)— (7. 16) satisfy the 
matrix equations (7.11)— (7.13), respectively. It should also be noted that the eigenvectors 
S n , P n , and Q n , which correspond to different eigenvalues, are orthogonal to each other. 
In other words, it is easy to show by direct calculation that these eigenvectors satisfy the 
orthogonality conditions 
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[S’f [P n ] = [S"] T [Q"] = [P"] T [Q"] = 0, (7.17) 

where the notation M T is used for a transposed matrix of an arbitrary matrix M. Fur- 
thermore, the three eigenvectors are normalized so that their absolute values are equal 
to unity. In other words they are unit vectors of the three independent directions. Thus, 
it can be shown by direct calculation that the eigenvectors satisfy the normalization 
conditions 

[S’f [S' 1 ] = [P' l f [ P n ] = [Q'f [Q"] = 1. (7.18) 

The set of normalized eigenvectors orthogonal to each other is called the set of orthonor- 
mal eigenvectors. 

In general, the eigenvectors of an operator OJJ', corresponding to the different eigen- 
values, are mutually orthogonal, if the tensor O mn — g ink O" is symmetric. It can be shown, 
by using the definitions 


or 


0™s n = X s S m , 
0'”P n = X P P m 


(7.19) 


0™S n P m = g nk O mk S n P m = O mk S k P m = X s S m P m , 

oyp n s m = g nk o mk P n s m = o mk p k s m = \ P p m s m 


Using the symmetry of the tensor O mk , that is, the equality O mk = O km , we can interchange 
the dummy indices m -<-> k to see that the left-hand sides of both of (7.20) are equal to 
each other. Furthermore, we note the equality S m P m = P m S m . With these properties, we 
can subtract the second equation from the first equation in (7.20) to obtain 

0 =(k s -Xp)S m P m . (7.21) 


From the result (7.21) we see that whenever the two eigenvalues are not equal to each 
other, that is, whenever X$ / Xp, the two corresponding eigenvectors are indeed orthogo- 
nal to each other 

S m Pm = gmnS m P n = 0. (7.22) 

The eigenvectors of an operator are in general determined up to an arbitrary multiplica- 
tion constant. However, in most cases it is convenient to have normalized eigenvectors of 
an operator, that is, to use unit vectors as eigenvectors. Thus, we require, as a convention, 
that the eigenvectors should satisfy the normalization conditions 


S m S m = g mn S m S n = 1. (7.23) 

This implies that the eigenvectors of an operator OJ" in the three-dimensional metric 
space (e.g., S m , P m , and Q m above) define three mutually orthogonal directions, provided 
that the corresponding eigenvalues are not equal to each other, that is, X$^Xp^Xq. 
These three directions are sometimes called the main directions of a second- order 


tensor 0" ! . 
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If the roots of the secular equation (7.5) are not distinct, we have a degeneracy. In 
the three-dimensional case, if two roots are equal to each other, the second-order tensor 
has only one main direction and a plane perpendicular to it, where all directions are 
main directions of the tensor. If all three eigenvalues of a second- order tensor are equal 
to each other, such a tensor does not distinguish any particular directions in the three- 
dimensional metric space. In other words, all directions in the three-dimensional metric 
space are the main directions of such a tensor. 


PART 2 


Tensor Analysis 




Tensor Derivatives 


8.1 Differentials of Tensors 


In the curvilinear coordinates the differential of a covariant vector A m , denoted by dA m , is 
not a vector. It is easily shown using the transformation law of covariant vectors (4.5). If a 
covariant vector is defined with respect to the systems of coordinates {x k } and {z k } by N 
coordinates A m and A m , respectively, then it transforms according to the transformation 
law 


Using (8.1) we obtain 


or 


dx n 

A m = — ~ A n (m, n = 1,2 N). 


'dx n , A , /9x"\ 
dAm -^ dAn + d \^) An ’ 


dx n d 2 X n k 

dA m = cL4„ A- A n r d z . 

dz m dz m dz k 


( 8 . 1 ) 


( 8 . 2 ) 


(8.3) 


The result (8.3) clearly shows that dA n does not transform as a vector, except in a special 
case when 


d 2 X n 

dz m dz k 


(8.4) 


that is, in the case of a linear orthogonal transformation of Descartes coordinates into 
some new Descartes coordinates. As the differential of a covariant vector 

dA 

d A m = — — dx" (m, n = 1, 2, . . . ,N) (8.5) 


is a system which is not a covariant vector, while the system dx n is a covariant vector, the 
second-order system is not a tensor. The observation that the differential d A m is not a 
vector is related to the definition of the differential, that is, 

cL4,„ = A m {x k + dx k ) - A m (x k ) (Jc, m = 1, 2, . . . , N) (8.6) 


By definition (8.6), d A m is a difference between two covariant vectors with the origins 
in different, infinitesimally close, points of the metric space. On the other hand, the 
multiplier in the transformation law (8.1) is of the form 


dX n 

dz™ 


= f^X k ), 


(8.7) 
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and it is in general a function of coordinates. The multiplier (8.7) is different in different 
points of the metric space, except in the special case of linear transformations. It is, 
therefore, evident that vectors, in general, transform differently in different points of 
the metric space and consequently the differential (8.6) cannot transform as a covariant 
vector. In order to construct a differential of a covariant vector in curvilinear coordinates, 
which is a covariant vector itself, it is required that both vectors on the right-hand side of 
(8.6) are situated in the same point of the metric space. This can be achieved by moving 
one of the infinitesimally close vectors, entering the right-hand side of (8.6), to the same 
point where the other vector is situated. 

The operation of moving one of the infinitesimally close vectors to the point where 
the other vector is situated must be performed in such a way that in the Descartes 
coordinates the resulting difference is reduced to the ordinary differential d A m . Since d A m 
is a difference of the components of the two infinitesimally close vectors, the components 
of a vector, to be moved to the point where the other vector is situated, must remain 
unchanged. This can only be achieved by a parallel displacement of the vector between 
the two infinitesimally close points, using the Descartes coordinates. 

Therefore, let us consider an arbitrary contravariant vector with the coordinates A m at a 
point of the metric space with coordinates x m . The coordinates of this contravariant vector 
at the point of the metric space with coordinates x m + dx m are denoted by A m + d A m . An 
infinitesimal translation of the vector A m to the point of the metric space with coordinates 
x m + dx m generates a translated vector denoted by A m at the point of the metric space with 
coordinates x m + dx m . Thus, the differential of the contravariant vector A m can be written 
in the form 

dA' n = A m (x k + dx k ) -A m (x k ), 
d A m = A m (x k + dx k )~ A m {x k + dx k ) 

+ A m (x k + dx k ) - A m (x k ), 
d A m = D A m (x k + dx k ) + SA m . 

In (8.8) we have introduced a differential D A m between the two vectors A m (x k + dx fc ) and 
A m (x k + dx k ) situated at the same point x k + dx k of the metric space, which itself behaves 
as a contravariant vector with respect to the coordinate transformations, as follows: 

D A m = A m (x k + dx k ) - A m {x k + dx k ). (8.9) 

Furthermore, in (8.8) we have introduced an increment 8A m , due to the parallel translation 
of the vector A m to the point of the metric space with coordinates x m + dx m , as follows: 

SA m = A m (x k + dx k ) - A' n (x k ). (8.10) 

As 8A m is a difference between the translated vector A m at the point x k + dx k and the non- 
translated vector A m at the infinitesimally close point x k , the system 8A m is not a vector. 
The difference 8A m vanishes in the Descartes coordinates and DA m reduces to d A m , as 
required. 

In order to calculate the increment 8A m , we note that it is a function of the coordinates 
of the contravariant tensor A m themselves. This functional dependence must be linear, 
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since a sum of two vectors must transform according to the same transformation law 
as each of the vectors. Furthermore, it also has to be a linear function of the coordinate 
differentials. Thus, we may write 


8A m = -r™A n dxP, 


( 8 . 11 ) 


where T™ p is a system of functions of coordinates, which are usually called the Christoffel 
symbols of the second kind. Composition with the metric tensor gives the Christoffel 
symbols of the first kind, as follows: 


m,np 


— Smk^ 


np' 


( 8 . 12 ) 


The system Tf p is dependent on the coordinate system and in Descartes coordinates all of 
its components vanish, that is, F™ p = 0. It is therefore clear that F’f p is not a tensor, since 
a tensor which is equal to zero in one coordinate system must remain equal to zero in 
any other coordinate system. In a Riemannian space it is not possible to find a coordinate 
system to satisfy the condition 

r£ =0 (k,n,p = 1,2, ...,N) (8.13) 


in the entire metric space. 


8.1.1 Differentials of Contravariant Vectors 

Substituting the result (8.11) into the definition (8.8), we obtain the result for the differen- 
tial of a contravariant vector A"', as follows: 

a am 

DA' n = cL4 m - SA m = — dx? + F^JF 1 dx p , (8.14) 

dxP n r ’ 

or 

( r)A' n \ T)A m 

+ r "LA" ) dx p = dx p . (8.15) 

dxP n r ) DxP 


8.1.2 Differentials of Covariant Vectors 

In order to derive an expression analogous to the result (8.15) for covariant vectors, let 
us consider an absolute covariant vector A m and an absolute contravariant vector B m . 
The composition of these two vectors gives an absolute scalar A m B m . As the scalars are 
invariant with respect to the parallel translation, we can write 

S(A m B m ) = B m SA m + A m 8B m = 0, (8.16) 

or 

B m 8A m = -A m 8B m = +A„T , f lp B m dx p , (8.17) 

where we have made an interchange of the dummy indices m o- n on the right-hand side 
of (8.17), such that it can be rewritten as 
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B m SAm = B m r» ip A n d lx p . (8.18) 

As the equality ( 8 . 18 ) is valid for an arbitrary contravariant vector B m , we have 

SA m = +T n mp A n teP. (8.19) 


On the other hand, analogously to the case of the contravariant vectors, we may write 


or 


D A m = dA m -&A m = -^. d xP - T n mp A n dxP, (8.20) 

D A m =(^- r n mD A n \ dxP = ^ dxP. (8.21) 

v dxP m r n ) D XP 


8.2 Covariant Derivatives 

8.2.1 Covariant Derivatives of Vectors 


From ( 8 . 15 ), we conclude that in the tensor analysis the differential of a contravariant 
vector d A m , which does not have a tensor character, is replaced by the tensor differential 
DA m . Comparing the results ( 8 . 5 ) and ( 8 . 15 ) we see that, following the same approach, 
we need to replace the partial derivative of a contravariant vector with respect to a 
coordinate, which does not have a tensor character either, by the covariant derivative of 
a contravariant vector with respect to a coordinate, as follows: 


dA m D A m 
~dxP DxP ’ 


( 8 . 22 ) 


where the covariant derivative of a contravariant vector is defined by (8. 15) , as follows: 


DA m 

DxP 



(8.23) 


In the Descartes coordinates, where Y" l p — 0, the covariant derivative reduces to the 
corresponding partial derivative. 

From the result ( 8 . 21 ), we see that the covariant derivative of a covariant vector is 
defined by the expression 


_ dA m _ 

DxP “ dxP m P n 


(8.24) 


The results ( 8 . 23 ) and ( 8 . 24 ) show that the covariant differentiation of both contravari- 
ant and covariant vectors gives the corresponding second-order tensors. In general, by 
covariant differentiation we preserve the tensor character of an arbitrary tensor, but we 
increase its order by one. It should be noted that for both contravariant and covariant 
vectors the order is increased by one additional covariant index. Due to the fact that this 
operation always increases the number of covariant indices of an arbitrary tensor by one, 
the corresponding derivative has been named the covariant derivative. 
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8.2.2 Covariant Derivatives of Tensors 


Let us consider a special case of a second-order contravariant tensor, obtained as a 
product of two contravariant vectors A m and B m , denoted by C mn — A m B n . By parallel 
translation, we obtain 

S(A m B n ) = A m SB n + B n SA m . (8.25) 

Using the definition (8.11), this expression becomes 

8(A m B n ) = -A m T^ p B k AxP - B n r™A k dxP, 

8(A m B n ) = - (r l p A m B k + r£M*F ! ) d j/P, (8.26) 

sc mn = - (r i p c mk + r ™ c to ) d xp. 

Due to the linear character of the operation of parallel translation, the result (8.26) is also 
valid for an arbitrary contravariant tensor C mn . As the differential of a contravariant tensor 
C mn is by definition given by 

DC™ = dC™ - 8C mn , (8.27) 


substituting from (8.26), we obtain 


DC” 


dxP 


_l_ Y n Qmk, _j_ pm ^-jkn 

kp kp 


dxP. 


(8.28) 


From (8.28), the covariant derivative of a second-order contravariant tensor C mn is defined 
as follows: 


DC™ 

DxP 


d C mn 
dxP 


-m f-^mk , pm r^kn 
kp ° 1 kp ° * 


(8.29) 


The same approach can be used for a second-order covariant tensor C mn — A m B n , where 
we may write 


8 (A m B n ) — B n 8A m + A m 8Bn, 

8 (A m Bn) = -t B n r k lp A k dxP + A m V k p B k dxP, 

8(A m B n ) = (v k mp A k B n + r k pA m Bk) dxP, ( 8 ' 30 ’ 

8C mn = (r k mp c kn + r k p c mk ) dxP. 

Again, due to the linear character of the operation of parallel translation, the result (8.30) 
is also valid for an arbitrary covariant tensor C mn ■ As the differential of a covariant tensor 
C mn is by definition given by 


DCm,, — dCmn — 8Cn 


(8.31) 


substituting from (8.30), we obtain 

D C mn = (jgjp - r k mp C kn - r k p C mk ) dxP. (8.32) 

From (8.32), the covariant derivative of a second-order covariant tensor C mn is 
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D Cmn _ dC mn n n fQ OQ , 

DxP _ dxP tnp'-kn 1 np^mk- (.o.oaj 

By simple consideration of (8.29) and (8.33), it is easy to define the covariant derivative of 
a mixed second-order tensor C™, as follows: 


D Cl 

DxP 


dC ; 


dxP 


n , pm pk pA: pm 
.n 1 kp^n 1 np^k * 


(8.34) 


Analogously to the definitions (8.29), (8.33) and (8.34), it is possible to define covariant 
derivatives of tensors of an arbitrary order and type. As an example, let us write down the 
covariant derivative of a mixed third- order tensor C"i 


DC; 


nk 


sc : 


nk 


■ pm^-tf _ p$ pm 

“T" 1 sp^nk 1 np^sk 


p5 pm 
1 pk^ns- 


(8.35) 


DxP dxP 

The example (8.35) clearly represents the general method for construction of the covariant 
derivatives of arbitrary tensors. 


8.3 Properties of Covariant Derivatives 

There is a number of important rules and special cases that apply to covariant differen- 
tiation. These rules and special cases are frequently used in tensor calculations, and the 
most important ones are listed below: 


1 . The covariant derivation is a linear operation and the following linearity condition is 
fulfilled: 


D 

DxP 


0 aA m + /SB'”) = a 


D A m 
DxP 


+ P- 


DB m 

DxP 


(8.36) 


2 . The covariant derivative of a product of two tensors is defined in the same way as the 
usual derivative of a product of two functions. As an example, for the product A m C mn , 
we may write 


D 

DxP 


(A m Cmn) 


D A m 
DxP 


C n 


+ A m 


D('mn 

DxP 


(8.37) 


3 . The covariant derivative of the metric tensor is equal to zero. In order to prove this 
proposition, let us observe that for a covariant vector DA m , in the same way as for any 
other vector, we have 


D A m — g mn DA n . 


(8.38) 


On the other hand, using A m = gmnA' 1 , we may write 

DA m = D(g mn A n ) = gmnDA n + A n Dg mn . (8.39) 

As A" is an arbitrary vector, by comparison of (8.38) and (8.39) we immediately 
conclude that 


D g mn = 0 =► Dg mn = 0. 


(8.40) 
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In other words, with respect to the operation of covariant differentiation, the metric 
tensor can be treated as a constant, regardless of the coordinate dependence of its 
components. 

4 . The covariant derivative of the 5-symbol is equal to zero. In order to prove this 
proposition, we use the above results 

Dg„ fc = 0 Dg mk = 0. (8.41) 


Using (6.17) and (8.41) we may write 

DC = D (g mk g nk ) = g mk Dg nk + g nk Dg mk = 0. (8.42) 

Using (8.34), it is also possible to prove this proposition by direct calculation 


D<5 


n O0 n , pm ok pA; om pm pm r» 

" n ~ dxP 1 kp ° n 1 np ° k ~ 1 np 1 n P ~ U ‘ 


(8.43) 


DxP dxP ' ~ k P 

5 . The covariant derivative of an invariant scalar function is equal to its ordinary partial 
derivative, and we may write 

D cf> _ 90 
DxP ~ dxP 

The partial derivative of a scalar function is, therefore, transformed as a regular 
covariant vector. In order to prove this proposition, let us observe a scalar </> = A m B m , 
where A m is some absolute contravariant vector and B m is some absolute covariant 
vector. The covariant derivative of the scalar function 0 is given by 


(8.44) 


or 


DxP DxP DxP DxP 


D0_ = ' ™ n A k B m +A m —~ T k mv A m B k . 

rtrP HrP k p rnjr K 


DxP dxP K P dxP 

By interchanging the dummy indices f o m,we see that the second term on the 
right-hand side of (8.46) is cancelled by the fourth term, and we obtain 

D0_ _ d _^l B +A m^L = ) = 

DxP dxP dxP dxP dxP 

which proves the proposition (8.44). 


(8.45) 


(8.46) 


(8.47) 


8.4 Absolute Derivatives of Tensors 

Let us assume that a curve C in the N - dimensional metric space is given by means of N 
parameter equations 

x m = x m (s) (m= 1,2, ...,1V), (8.48) 

where 5 is some scalar parameter. Let us now consider a covariant vector^ defined along 
the curve C as a function of the parameter s. 
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The absolute derivative of the vector A m with respect to the parameter s is defined using 
(8.21) as follows: 


D A m 
ds 


3 A n 
~dxP 


~ ^mpAnj 


dxP 

ds 


or 


D A„ 


D A m dx p 


( 8 . 49 ) 


( 8 . 50 ) 


ds DxP ds 

In the same way we define the absolute derivative of a contravariant vector A m , as follows: 

D A m D A m dxP 

( 8 . 51 ) 


ds DxP ds 

The definitions of the absolute derivatives (8.50) and (8.51), for covariant and contravari- 
ant vectors respectively, are easily generalized to the case of an arbitrary tensor. For 
example, the absolute derivative of a tensor is given by 


DC: 


nk 


ds 


DC^dxP 

DxP ds 


where the covariant derivative of the tensor C”l is given by (8.35), that is, 


DC; 


nk 


3 C: 


nk 


■ praps _ pS _ pS pm 

' 1 sp^nk 1 np^sk 1 pk^ns- 


( 8 . 52 ) 


( 8 . 53 ) 


DxP 3 xP 

The properties of the absolute derivative are the same as the properties of the covariant 
derivative and will not be repeated in this section. 


Christoffel Symbols 


9.1 Properties of Christoffel Symbols 


In order to define various tensor derivatives in Chapter 8, we have introduced a system 
r"p, called the Christoffel symbol of the second kind. The objective of this chapter is to 
specify this system and outline its most important properties. 

Let us first prove that the Christoffel symbols are symmetric with respect to their lower 
indices. In order to prove this statement we note that \iA m is an arbitrary covariant vector, 
then the quantity 


I tt,, ; D Ap 
D xP ~ Dx m 


( 9 . 1 ) 


is a second-order covariant tensor. Let us now assume that the vector A m can be given by 
the expression 


A D 4> 90 

m Dx m dx' n ’ 

where 0 is some arbitrary scalar. Then the expression (9.1) can be written in the form 


( 9 . 2 ) 


Dfin 


DA? 


D xP Dx m 


dA m 

~dxP 


or 


DA? 


DA, 


d z 4> 


r mpAk ■ 


3 2 0 


3 Ap 
~dx™ 


-> k 
pm * 


(r k - 1 

l 1 pm 1 mp i 


D xP Dx m dxPdx m dx m dxP 

The first and second terms in (9.4) cancel each other and we obtain 


30 

dx k ' 


DA, 


DA,? 


D xP Dx m 


= (r k - r k ) 

\ 1 pm 1 mp J 


30 
dx k ' 


( 9 . 3 ) 


( 9 . 4 ) 


( 9 . 5 ) 


As the left-hand side of (9.5) is a second-order covariant tensor, we conclude that the right- 
hand side of (9.5), that is, 


/ pi: __ pi: ) ^ (g 5) 

pm m P) Q x k v 1 

is also a second-order covariant tensor. However, we know by definition of the Christoffel 
symbols, that they are identically equal to zero in all points of the Euclidean metric space 
described by the Descartes coordinates. Thus, the second-order covariant tensor (9.6) is 
identically equal to zero in the Descartes coordinates. But a tensor, which is identically 
equal to zero in one coordinate system, must be equal to zero in any other coordinate 
system. Thus, we conclude that 
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D A m D A v 


As the vector A m , given by 


D xP Dx m 


Tn — 


= (r k - r k \ 

I, p m i m p i 


90 

dx k 


= 0. 


D0 90 


(9.7) 


( 9 . 8 ) 


Dx m dx m ’ 

is an arbitrary covariant vector and 0 is an arbitrary scalar function, we conclude from 
(9.7) that 


r pm = r mp (k,m,p=l,2,...,N), 


( 9 . 9 ) 


and the Christoffel symbols of the second kind are indeed symmetric with respect to 
their lower indices. From (9.9) it is evident that, for the Christoffel symbols of the first 
kind, we have 


r n,pm — T n ,mp {M, 11, p — 1,2, , N ) . 


( 9 . 10 ) 


Let us now derive the transformation law of Christoffel symbols with respect to the 
transformations of coordinates. If a contravariant vector is defined with respect to the 
systems of coordinates {x k } and {z k } by N coordinates A m and A m respectively, then it 
transforms according to the transformation law 


Using (9.11) we may write 


dA' n 

~dzP 


f)z m ■ 
- A m = ^ A ]_ 

dx> 


— ( = —JL( — aA , 

dzP \dx! ) dzP dx k \dx) ) 


or, after calculating the derivative in the parentheses 


dA m _ dx k dz m dAi dx k d 2 z m j 
~dzP ~ dzP ~dxf dx k + ~dzP dx k dxj 


( 9 . 11 ) 


( 9 . 12 ) 


( 9 . 13 ) 


The transformation law (9.13) is just a direct confirmation of the fact that the partial 
derivative of a contravariant vector is not a tensor, as we have shown indirectly in 
Chapter 8. On the other hand, the covariant derivative of the contravariant vector is a 
mixed second-order tensor and it transforms according to the transformation law 


DA m _ dx k dz m D N 
DzP ~ dzp-dxf Dxic' 


( 9 . 14 ) 


Here, we recall the definitions of covariant derivatives of contravariant vectors (8.23), in 
the coordinate system {z k } 


D A m 
DzP 



and in the coordinate system {x k }, that is, 


DAJ dAi j , 
Dxk ~ ~^k + n A ■ 


( 9 . 15 ) 


( 9 . 16 ) 
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Substituting (9.15) and (9.16) into (9.14), we obtain 


3 A m dx k dz m dAi dx k d z m j , 

+ r La" = r- + -tLa'. 

’r n 7 p a Y k 3 z p qxj lie 


dzP ' ‘ ,lfr ~ dzP 3 xi dx k 

Substituting (9.13) into (9.17) we obtain 


dx k 3 z m dAi d X ic g 2 z m 


dx k dz m dA J dx k dz m 

~.N + r LLA" = r- — r + ... 

i ’r a 7 p a yj aA 3 Z P dxl lk 


r,,A. 


( 9 . 17 ) 


( 9 . 18 ) 


dzP 3 xi dx k dzP dx k d xi "p dzP dxi dx k 

The first term on the left-hand side cancels the first term on the right-hand side in (9.18), 
and by using 


A' = S\A 1 


A" = ^A 1 , 
dx l 


( 9 . 19 ) 


we obtain from (9.18) 


/ jdX k d 2 Z m - m dZ n \ I dX k d Z m ,■ , 

y 1 dzP dx k dxJ ' ip dx l ) dzP dxl ,k 


( 9 . 20 ) 


As (9.20) is valid for an arbitrary contravariant vector A 1 , this tensor can be omitted. After 
omitting A 1 and multiplying the remaining equation by 


we finally obtain 


dz p dxi 
dX^d Z™’ 


( 9 . 21 ) 


j dxi dz n dzP - dxi 3 2 z m 

ri. = r — r r , + 7 — 7. 9.22 

11 dz m dx l dx k ' lp dz m dx k dx l 

Equation (9.22) is the required transformation law for the Christoffel symbols, and it shows 
that the system rj fc behaves as a tensor only with respect to the linear transformations of 
Descartes coordinate systems, when we have 


d 2 z m 

dx k dx l 

In general, as we have concluded before, the Christoffel symbol is not a tensor. 


( 9 . 23 ) 


9.2 Relation to the Metric Tensor 


For practical calculation of the Christoffel symbol we need to find its relation to the metric 
tensor. In order to find this relation, we start with the identity 


or 


D gmn 

dxP 


dgn 


dxP 


^mpgkn T npgmk 9, 


( 9 . 24 ) 


dgmn 

dxP 


= r 


n,mp 


+ r 


m,np- 


( 9 . 25 ) 
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The permutations of indices in (9.25) give the equation 

dgpn 


dx n 


= r 


m,pn 


"T T p,mn — 0, 


( 9 . 26 ) 


and, with the reversed signs, the equation 

dgnp 


dx m 


— Fppim T n ,pm — 0. 


( 9 . 27 ) 


By adding together (9.25)— (9.27) and using the symmetry property (9.10), we obtain 


m,np 


_ 1 ( dgmn 3 gpm dgnp\ 

dX m ) ' 


dxP + dx n 


( 9 . 28 ) 


The result (9.28) is a definition of the Christoffel symbol of the first kind as a function of 
the metric tensor. The Christoffel symbol of the second kind is then given by 


pm p.mA;p * i ~cwt , 

1 np~ 5 1 k,np ~ 1 “T «„ 


mk 


dgkn , dgpk _ 3 gnp\ 

dxP + dX n dx k ) ' 


( 9 . 29 ) 


The result (9.29) is a definition of the Christoffel symbol of the second kind as a function of 
the metric tensor. Using the definition (9.29), we can calculate the contracted Christoffel 
symbol of the second kind F“ n , as follows: 


r m = - hr 

1 nm o \ o 


r mk 3 gkn 

dx m 


+ g‘ 


mk 


dgmk 

dx n 


~g‘ 


mk dgnm 


dx k 


( 9 . 30 ) 


Due to the symmetry of the metric tensor and the symmetry of the Christoffel symbol with 
respect to the interchange of its lower indices, the first term in the parentheses of (9.30) 
cancels the third term, and we obtain 


r m = -p 

1 nm 2 ° 


.mk 


dgmk 
dx n ‘ 


The determinant of the metric tensor is given by the expression (6.12) as follows: 


g — gmkG 


<mk 


( 9 . 31 ) 


( 9 . 32 ) 


where G mk is a cofactor of the determinant \g m k\ which corresponds to the element g,^. 
The differential of the determinant g is, in view of (9.32), equal to 


3 g 


dg = 7 d gmk = G mk d gmk, 

ogmk 

Using now the definition of the contravariant metric tensor (6.16), that is, 

G mk rrrrVnk 

= gg > 

in (9.33), we obtain 


( 9 . 33 ) 


( 9 . 34 ) 


3g 


dg = gg dgmk => ^ = ; 

Substituting (9.35) into the result (9.31), we obtain 


r-mk 


d&mk 

dX n ‘ 


( 9 . 35 ) 
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r m _ = 9 ln Vg 

nm 2 g dx n dx' 1 

If we now recall the result (6. 17) in the form 

8 mk 8nk = <5,7- 

we may contract it to obtain 

8 mk 8mk = $'m = N (k,m = l,2,...,N). 
Thus, by differentiation with respect to the coordinate x n , we have 


9 

9x» 


(g mk 8mk) 


„mk dgmk dg' 

= g -x-jr + gmk 


mk 


dx n 


dx n 


= 0, 


or 


„mk d&mk _ 

5 0 t-i — Smk 


mk 


( 9 . 36 ) 


( 9 . 37 ) 


( 9 . 38 ) 


( 9 . 39 ) 


( 9 . 40 ) 


dx n dx' 1 

Thus, there is an alternative expression for the contracted Christoffel symbol of the second 
kind obtained by substituting (9.40) into (9.31) 


r nm o 8mk 


dg' 


mk 


dx n 


( 9 . 41 ) 


It is also of interest to calculate the quantity g kn C™ n , since it frequently appears in various 
calculations in the tensor analysis. This quantity is, by definition, given by 


g kn r™ =g* n g"“r lM =--g™ g ‘ 


~.kn „ml 


kn 


T ml ( _i_ d 8' 11 _ 


\dx n + dx k 


dx 1 


( 9 . 42 ) 


Using the symmetry of the metric tensor and interchanging the dummy indices k -<-> n, we 
see that the first two terms in the parentheses of (9.42) are equal to each other. Thus, we 
may rewrite (9.42) as follows: 


J/irm a kn a ml 

5 1 kn — s S 


( _ 1 d 8kn \ 

\ dx n 2 dx 1 )' 


( 9 . 43 ) 


The first term on the right-hand side of (9.43) can be rewritten as follows: 


kn nil dgkl 

- a^ n — 

8 ^ dx n dx n 


(g ml gu) 


kn d g‘ 

g kll gkl 6 


ml 


an dg 


_ rrkn w ( $m\ ?r 

- 8 W (Sk) - S ‘Jx^- 

Combiningthe results (9.31) and (9.36), we may write 

3 gkn 1 dg _ 3lnVg 


dx n 

ml Qginn 


dx n 


l 


kn 


2 g dx 1 2 gdx 1 dx 1 ’ 

and the second term on the right-hand side of (9.43) becomes 

1 3 V8 


( 9 . 44 ) 


( 9 . 45 ) 


1 ml„kn 3 gkn _ _ mZ ® Inyg = 

_„mn 

oO O r,/ o a/ O 


dx 1 


dx 1 


Vg dx n 


( 9 . 46 ) 
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Substituting (9.44) and (9.46) into (9.43) we obtain 

'dg mn 1 dVS\ 

dX n * yg dX n ) 
dg mn , 


pkn^m _ 

o 1 kn ~ 


Vs 


= -V= Vs-V^+s mn ~VV 


dx n 


dx n 


( 9 . 47 ) 


and using the rule for the derivative of the product VSS mn > we finally obtain the expression 
for the quantity as follows: 


g£»r 


m 

kn 


l d 

VS dX n 


( Vss mn )• 


( 9 . 48 ) 


Differential Operators 



10.1 The Hamiltonian V-Operator 

In the ordinary three-dimensional Euclidean metric space the Hamiltonian V-operator is 
defined by the following expression: 

V m = d m = 7 ^- (m= 1,2,3). (10.1) 

dx m 

However, by acting with this operator on an arbitrary vector or tensor we obtain a system 
which does not have the tensor character, as shown by, for example, (9.13). In an arbitrary 
AT- dimensional generalized curvilinear coordinate system, instead of V m , we, therefore, 
use the covariant Hamiltonian operator D m , defined by 

= (m= 1,2,..., AT). (10.2) 

Dx m 

Using the operator D m , we define the operators called gradient, divergence, curl, and 
Laplacian in the following four sections. 


10.2 Gradient of Scalars 


Gradient of a scalar function </> is a covariant vector with its covariant coordinates defined 
as follows: 


n , _ G<f> _ d<j> 
m<P Dx m dx m ' 

The contravariant coordinates of the gradient of a scalar <j> are obtained as follows: 


(10.3) 


D m <p = g mn — = g m,! — . (10.4) 

v 6 Dx n 6 dx n 

In the orthogonal curvilinear coordinates, with the metric form given by (6.60), that is, 

ds z = (h\ dx 1 ^ + ^2 dx 2 ^j + ^/i 3 dx 3 t , (10.5) 

the physical coordinates of the gradient of a scalar <p are given by 

0(0)4 = T~D m <l> = (m,M= 1,2,3). (10.6) 

«m «M dx' n 


The definition (10.6) gives the correct physical coordinates of the gradient in the well- 
known cases of cylindrical and spherical coordinate systems. As an illustration, let us 
consider the spherical coordinates where, using (6.65) and (6.66), we may write 


x = r, 
hi = 1 , 


= r, hi = rsinA 


(10.7) 
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The gradient of a scalar function </> is then, according to (10.6), given by 


J 30 _ 1 90- 1 90 _ 

grad0 = — e r + ~—e g + — — — e v , 
dr r 3 9 rsmd dip 

where e r , eg, and e,, are the unit vectors in spherical coordinates. 


( 10 . 8 ) 


10.3 Divergence of Vectors and Tensors 


Divergence of an arbitrary contravariant vector A m is a scalar obtained by the composition 
of the covariant Hamiltonian operator D m with that contravariant vector A'", that is, 


D m A 


m 


8A m 
dx ™ 


pm au 
1 nm 71 * 


Using here the result (9.36), we may write 


(10.9) 


D m A m = 


9 in y/g „ 
dx m dx 11 


(10.10) 


Since n is a dummy index in the second term on the right-hand side of (10.10), it can be 
changed to m. Thus, we may write 


m 3 A m 1 3 Jg m 

D m A = + ^-A m , 

dx m VI dx m 

VI V V 9x m 3x m 


( 10 . 11 ) 


Finally the expression for the divergence of the contravariant vector A m is given by 


D m A m = — — (JgA m ) . (10.12) 

Vg 3x m W6 ’ 

If a vector is defined by its covariant coordinates A m , then the divergence of such a vector 
is written in the form 


D m A m = g mn D n A m = D„ ( g mn A m ) , (10.13) 

where we have used the property that the metric tensor behaves as a scalar with respect to 
covariant differentiation. In such a case, using (10.12), we obtain 

D m A m = -V {~Jgg mn An) ■ (10.14) 

Analogously to the case of a contravariant vector (10.12), it is possible to define the 
divergence of an arbitrary tensor with respect to one of its upper indices. Thus, the 
divergence of the tensor T"'" is defined by 

In the orthogonal curvilinear coordinates, with the metric form given by (10.5), we have 
*Jg — h] hihi and the physical coordinates of the divergence of a contravariant vector A m 
are given by 
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D m A m = 


1 


hih 2 h 3 ^) , 

IlM ) 


li\h 2 h 3 3 x m 

where the physical coordinates of the contravariant vector A m are given by 

A m = h M A m (m,M = 1,2,3). 


(10.16) 


(10.17) 


The definition (10.16) gives the correct physical expressions for the divergence in the well- 
known cases of cylindrical and spherical coordinate systems. As an illustration, let us again 
consider the spherical coordinates defined by (10.7) above. The physical expression for the 
divergence of the vector A in the spherical coordinates is given by 


19 

divA = — 

r 2 dr 




1 


rsinS 39 


9 . 1 

(sm6>A fl ) + 


9A„ 


rsin6> 36 


(10.18) 


10.4 Curl of Vectors 


In an arbitrary IV-dimensional metric space the curl of a vector function A m is a second- 
order covariant tensor F mn , defined by the expression 

Fmn = DmAn — D^Am (rri t Yl = 1,2 ,..., AT). (10.19) 


In the special case of a three-dimensional metric space, it is possible to construct a 
contravariant vector C k related to the tensor (10.19) using the three-dimensional totally 
antisymmetric Ricci tensor s kmn , defined according to the general definition (6.23) as 
follows: 

s kmn = — e kmn ( k , m, n= 1,2, 3). (10.20) 

Vs 

Thus, the curl in the three-dimensional metric space is defined by 

C k = Lkmn Pmn = s kmn Dm A n = -L e kmn D m A n . (10.21) 

2 VS 

In the orthogonal curvilinear coordinates, with the metric form given by (10.5), we have 
Vg — h] hiV, and the physical coordinates of the curl of a covariant vector A,; are given by 

curl (fc) A = h K C k = Hk e kmn — (. h N A m ) . (10.22) 

w hih 2 h 3 3x m v 1 ’ 


where the physical coordinates of the covariant vector A n are given by 

A (n) = tt~ (n,N= 1,2,3). 
n N 


(10.23) 


As the expression (10.22) is somewhat more complex and generally not readily under- 
stood, we expand it for the three components of the curl in three dimensions 


Curl<1,/1 -S 2 /,3V 


/ d(h 3 A( 3 )) _ 3 (h 2 A ( 2 )) \ 
3 x 2 dx 3 




curl ( 2 ) A = 


1 


/ d(h\A { p) ^ 3 (h 3 A i3 )) \ 
h\h 3 \ dx 3 dx 1 J ’ 


(10.24) 
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curl, 3) A = 


1 

h\h 2 


( d(h 2 A (2) ) 

{ 


3(feiA,i)) \ 

3x 2 / ’ 


The expressions (10.24) are usually structured into a determinant defined as follows: 



h\l 

h 2 i 

h 3 3 

1 

3 

3 

d 

hih 2 h 3 

3X 1 

dX 2 

dx 3 


h\A(\) 

iizA( 2 ) 

1^3 A(3) 


(10.25) 


The definition (10.25) gives the correct physical expressions for the components of the curl 
of a vector function in the well-known cases of cylindrical and spherical coordinate sys- 
tems. As an illustration, we can write down this determinant for the spherical coordinates 
defined by (10.7). Thus, we may write 


or 


curl/1 = 


1 


r 2 sin 6 


e r reg rsm6e<p 
3 3 3 

dr 3 6 dtp 
A r rAg r sin 9A V 


cur 1/1 = 


1 


rsin0 


9 / . \ 3 Ag 

— (sinOA,) 

3 9 x dip \ 


1 

H — 
r 


_ l (M )' 

sine dtp dr v _ 


eg 


1 

+ — 
r 


9 . , , dA r 

— (rA g ) 

dr dd 


ew 


(10.26) 


(10.27) 


10.5 Laplacian of Scalars and Tensors 

The Laplacian of a scalar function in the three-dimensional vector analysis is defined by 
the expression 

Atp = div(grad^). (10.28) 

In the tensor analysis this expression becomes 

A<p = D m D m 4> = D rn ( g mn D n ct> ) = D m • (10.29) 

From (10.29) we see that the Laplacian of the scalar function <j> is a divergence of a vector 
given by its covariant coordinates 

30 

D„0 = (10.30) 

Using here the definition (10.14) of the divergence of a vector defined by its covariant 
coordinates, we may write 
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A 4>=— — (vSS mn — V (10.31) 

yg 3x m V 3x"/ 

In the orthogonal curvilinear coordinates, with the metric form given by (10.5), we have 
yg = h\h 2 h-i and the Laplacian of the scalar function </> becomes 


where 


A (p = 


1 3 

hih 2 h 3 3x m 




[g^] = 


(hi)~ 2 0 0 

0 (h 2 r 2 0 

0 0 (h 3 )~ 2 


Substituting (10.33) into (10.32) we obtain 


A 4> = 


1 

h\ h 2 h 3 


_3_ fh 2 h 3 3 (P \ 
3x l V h\ 3x 1 / 



/ h\h 3 3 (j> \ 3 / h\ h 2 

\ h 2 3x 2 ) + dX 3 V h 3 



(10.32) 


(10.33) 


(10.34) 


The definition (10.34) gives the correct physical expressions for the Laplacian in the well- 
known cases of cylindrical and spherical coordinate systems. As an illustration, in the 
spherical coordinates defined by (10.7), the expression for the Laplacian becomes 


1 3 ( ? 3 <p\ 1 3 / . 3 <p\ 1 d 2 (p 

A 0 = — — r 2 — + sin 0— + = a. 

r 2 3r \ 3 r ) r 2 sinO 3 9 \ 36 J r 2 sin 2 6 dcp 2 


(10.35) 


1 0.6 Integral Theorems for Tensor Fields 

In the three-dimensional vector analysis there are two important integral theorems, called 
the Stokes theorem and the Gauss theorem. These theorems remain valid in the tensor 
analysis and their formulation is generalized in such a way that they can be applied to 
integrals in arbitrary AT- dimensional metric spaces. These theorems in the tensor notation 
will be discussed in the rest of this section. 

10.6.1 Stokes Theorem 

In the usual three-dimensional vector notation the Stokes theorem is defined as 

j) A - dr — J curL4 • dS. (10.36) 

The integral on the right-hand side of (10.36) is a surface integral over a surface S bound 
by a closed contour C. The integral on the left-hand side of (10.36) is a line integral round 
the closed contour C running along the boundaries of the surface S. In the tensor notation 
the formulation of the Stokes theorem has the following form: 
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- f 

2 Js 


Am dx m =- F mn dS mn , 


(10.37) 


1C 2, 

where F mn is the Curl tensor of the vector A m , defined by (10.19), that is, 

Fmn — F) m A n — D n A m • (10.38) 

On the other hand, 

dS m " = dx'" dx n (10.39) 


is the contravariant tensor of an infinitesimal element of the surface S. 

From the definition (10.37) we see that the Stokes theorem in tensor notation is valid for 
arbitrary generalized metric spaces. Let us now prove that (10.37) is equivalent to (10.36) 
in the three-dimensional metric space. We first note that in the three-dimensional space 
we have 

A m dx' n =A-dx (m = 1, 2, 3). (10.40) 

From (10.40) we see that the integrals on the left-hand sides of (10.36) and (10.37) are 
indeed equivalent to each other. In order to prove that the right-hand sides of (10.36) and 
(10.37) are also equivalent to each other, it is convenient to introduce a three-dimensional 
covariant surface vector as follows: 

dS k = -e kmn dx m dx" = -v) ge kmn dx m dx n . (10.41) 

In the Descartes coordinates, where Jg = 1, the components of this vector are given by 
dSi = dx 2 dx 3 , dS2 = dx 1 dx 3 , dS3 = dx 1 dx 2 . (10.42) 


Furthermore, analogously to (10.21), it is convenient to introduce the contravariant curl 
vector, denoted by C k and related to the tensor F mn by the expression 

C k = ^e kjl Fji = e kjl DjAi = = (curl!)* . (10.43) 

Using (10.43) we may write 

curlA • dS = ^curLd) dSt = ^e kkI e km nDjAi dm dbc n . (10.44) 

On the other hand, the e- symbols satisfy the identity 

e kJl e k mn = 4nS l n ~ S n S m- U0.45) 

Substituting (10.45) into (10.44) we obtain 

curlA • dS = ^ (fin&h ~ <0 l m ) DjAidx m dx", 
curlA • dS = -( D m A n — D n A m )dx m dx", 
curlA • dS = l -F mn dS mn . 


(10.46) 
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From (10.46) we see that the integrals on the right-hand sides of (10.36) and (10.37) are 
also equivalent to each other in the three-dimensional metric space. 


10.6.2 Gauss Theorem 


In the usual three-dimensional vector notation the Gauss theorem is defined by means of 
the equation 

<pA-dS= I divide. ( 10 . 47 ) 

Js Jn 


The integral on the right-hand side of (10.47) is a volume integral over a volume Q bound 
by a closed surface S. The integral on the left-hand side of (10.47) is a surface integral over 
the closed surface S enclosing the volume £2. The volume element clQ is a relative scalar 
with the weight M = — 1. If the volume element is defined with respect to the systems 
of coordinates {x k } and {z k } by dQ, and d£2 m respectively, then according to the Jacobi 
theorem it transforms as follows: 


= 


dz m 

3 ^ 


dQ = A -1 d£2. 


( 10 . 48 ) 


Therefore, in the tensor formulation of the Gauss theorem, we use the invariant volume 
element ^/gd^, to obtain 

(f A m dS m = [ D m A m JgdQ.. ( 10 . 49 ) 

Js Js 2 

The tensor formulation of the Gauss theorem (10.48) is valid for arbitrary generalized 
metric spaces. The equivalence of (10.47) and (10.48) in the Descartes coordinates, where 
Jg — 1 and D m = V m is evident. 

The Gauss theorem can be extended to the case of an arbitrary tensor with at least one 
upper index. As an example, for a mixed third- order tensor 77””, the Gauss theorem is 
given by 

(f T[ nn dS m = [ D m T™^gdQ. 

Js Ja 


( 10 . 50 ) 



Geodesic Lines 



In Euclidean metric spaces the shortest distance between two given points is a straight 
line. However, even the simplest example of a two-dimensional Riemannian space on 
the surface of a unit sphere, shows clearly that in the Riemannian spaces there are in 
general no straight lines. It is therefore of interest to find the curves of minimum (or at 
least stationary) arc length connecting the two given points in a Riemannian space. 

In Riemannian spaces, we in general do not impose the requirement to find the lines 
of the minimum arc length, and we replace it by the requirement to find the lines with a 
stationary arc length between the two given points. We can illustrate this on the simple 
example of the two-dimensional Riemannian space on the surface of a unit sphere. For 
both arcs on the same circle, connecting the two given points on the unit sphere, we can 
only say that their length is of a stationary character, but we cannot say that they are of 
minimum length. 

The curves with stationary arc length between two given points A and B are called the 
geodesic lines and they are determined by the solutions of the corresponding geodesic 
differential equations. In order to construct these differential equations, we will use the 
variational calculus and Lagrange equations which will be derived in the next section. 

11.1 Lagrange Equations 

Let us observe two fixed points A and B in an arbitrary metric space. Their coordinates are 
given by the parameter equations 

x m = x m (S A ) x m = x m (S B ), (11.1) 

where s is the arc-length parameter. Between the fixed points A and B we can draw a family 
of curves of various arc lengths, as shown in Figure 11.1. Within the family of curves, which 
connect the two fixed points A and B as shown in Figure 11.1, there is a single curve C 
with a stationary arc length, which we call the geodesic line. All other curves between the 
two fixed points A and B, for example, C and C" , do not have the stationary arc length 
and they deviate from the geodesic lines by some variations Sx m (s ) for all values of the 
parameter 5. It should be noted that the points A and B are fixed and common for all the 
curves connecting these two points. Therefore, the variations Sx m at these two points are 
equal to zero 

Sx m (S A ) = Sx m (S B ) = o. ( 11 . 2 ) 

Let us now define a function 
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C 

FIGURE 11.1 Curves connecting the fixed points A and B. 


L = L 



dx m 

ds 



(11.3) 


along each of the lines connecting the points A and B. This function may depend on the 
coordinates x m , their first derivatives with respect to the parameter s, as well as on the 
parameter s itself, as indicated in the definition (11.3). This function is usually called the 
Lagrange function or Lagrangian. The integral of this function between the points A and B 
is given by 


7 = 




ds, 


(11.4) 


and it is usually called the action integral. According to the Elamiltonian variational 
principle, the integral (11.4) has a stationary value along the geodesic line C. Using the 
variational principle, we can derive the differential equation satisfied by the Lagrangian 
function (11.3). The variation of the action integral (11.4) along the geodesic line C is equal 
to zero and we may write 



By definition of the variation we have 


ds = 0. 


8L = Llx m + 8x n 


dx‘ 


dm 

ds +5 d7’ s 




(11.5) 


( 11 . 6 ) 


Since the variations of the coordinates and their derivatives with respect to the parameter 
are small, we can expand the first term on the right-hand side of (11.6) into a Taylor series 
and keep only the zeroth- and first-order terms. Thus, we obtain 


/ m dx m \ dL 

8L = L\ x m , — — , s I -I 8x' n 7 

V d s’ ) dx m 9 7 


-L x‘ 


dx m \ dL „ 

ds ) dx m 


dL „dx m 
ds 

dx m 
.a 

ds 


8x m ' 
ds j 

dL 

K#)' 


(11.7) 


Since the differentiation with respect to the parameter is independent from the variation, 
the order of these two operations can be interchanged. Thus, we may write 
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or 


SL = — Sx m - A 
3x m ds 


dL - v m 


3 L 


»(*&)- 

Substituting (11.9) into (11.5) we obtain 
r _ [ SB \lL 

Js a dx m 


SI : 


d 

ds 


37, 




Sx m + — 
ds 


Sx m ds + 


3 L 


'(t) 


<5x" 


(11.9) 


3 7, 

7 

l d s j 


<5x" 


Sb 


= 0. 


( 11 . 10 ) 


L V ds / J J V ds / Sa 

ich is calculated at the boundary points of the geodesic line 
inditions I 1 Thus, wo nhtain 


[ SB . 

3 L 

d 


3 L 

\ 

Js A 

dx m 

ds 

3 

( dz m \ 

l ds )] 

1 


57 = 


Since the variations Sx m are arunrary anu in genei 
requires that the following equations are satisfied: 


Sx m ds = 0. 


(11.11) 


v L \ di / j j 

arbitrary and in general different from zero, the result (11.11) 
quations are satisfied: 

= 0 (m = 1,2, ... ,7V) 


3 L d 
3x™ ~ ds 


3 L 

a ( Ax m \ 

9 l“drj J 


( 11 . 12 ) 


The system of A' -equations (11.12) is a system of differential equations that must be 
satisfied by the Lagrangian function (1 1.3) along the geodesic line C. These equations are 
usually called the Lagrange equations. 

1 1 .2 Geodesic Equations 

In order to construct the geodesic equations which define the curve with a stationary arc 
length, we may choose the arc length itself as the action integral with zero variation. Using 
the expression (6.38) we write 

cSb 


f SB I dx k dx n , 

I= L dF'ar 


From (11.13) we see that the Lagrangian function is given by the expression 


L = 


dx k dx" 


and it is equal to unity along the geodesic line C, where 

ds 2 = gt„ dx k dx n (k,n = 1,2, ... ,N). 

Along the geodesic line, where the metric condition (11.15) is fulfilled, we have 

37, 


(11.13) 


(11.14) 


(11.15) 


'(¥) 


1 / dx* dx" \ z dx 1 dx 1 

= 2 2gml ~di = 8ml dF’ 


(11.16) 
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and 


d 

ds 


dL 


a ( dx m \ 


dx' 


d 2 x l d gml dx ? dx k 


- ds [ 8ml ds ) ~ g ' nl ds 2 + ;,x k ds ds 


(11.17) 


By interchanging the dummy indices lc o- l, we may rewrite the second term on the right- 
hand side of the result (11.17) as follows: 


dgnd cLd dx* _ 1 (dgM + dx ? dx k 


dx k ds ds 2 V dx k 

Substituting (11.18) into (11.17) we obtain 


dx l ) ds ds 


d 

ds 


dL 


d* 1 ) 


d 2 X l 1 (d g m i dgmJc \ dx l dx k 

= gml — ^ + - 1 d 1 — 


ds 2 2 V dx k + dx l ) ds ds 


On the other hand we have 


dL 


dx k dx h 


dx m 2 \ 8kn ds ds 


dg [k dx l dx k 
dx m ds ds 


Using the metric condition (11.15), we obtain 


dL 1 dgi k dx l dx k 
dx™ ~ 2dx™~ds~ds~' 

Substituting (11.19) and (11.21) into the Lagrange equations (11.12), we obtain 


8ml 


d 2 X l 1 / dgml dg mk _ dgik\ dx l dx k = 


0 


ds 2 2 \ dx k dx l dx m ) ds ds 

Using now the definition of the Christoffel symbols of the first kind (9.28), that is, 


Lm,lk 


_ 1 / dgml dg km dgi k 
~ 2 V dx k dx l dx m 


(11.18) 


(11.19) 


( 11 . 20 ) 


( 11 . 21 ) 


( 11 . 22 ) 


(11.23) 


we obtain from (11.22) 

d 2 x l dx ; dx* 

gm, d? + Tm ’ lk d7^7 = (n 4) 

The composition of (11.24) with the contravariant metric tensor g mn gives the most 
commonly used formulation of the geodesic equations, as follows: 


d 2 x" „ dx 1 dx k 

“ds 2 ” + r ‘ k d7“ds“ = °' 


(11.25) 


If a set of parameter equations of some curve C in a generalized N - dimensional metric 
space is given by 


x m = x m (s) (m = 1,2, . . . , TV) 


(11.26) 
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then the tangent vector to this curve is defined by 

dx" 


u" = 


ds 


(m = 1,2,..., AT). 


Using (11.27), (11.25) can be written in the form 


du n n l k 
— + r>'u* = 0. 


Using (11.22) we can also write 


Sml 


^ + I ( a Ja* + d Jm k ) u i u t _ 


ds 2\ dx k dx 1 ) 


2 dx m 


= 0 


(11.27) 


(11.28) 


(11.29) 


Interchanging the dummy indices l k in the second term on the left-hand side of 
(11.29), we obtain 

d U l dg m i l k 1 dgi k i k _ 

8ml d s + 9x* 2dx m 

On the other hand, using the definition u m = g m iu l , we may write 

^ = ~{ gml u l ) = gml ^ + = gml ^ + d &tu l u k 

ds ds ; Sml ds ds Sml ds dx k 

Substituting (11.31) into (11.30), we obtain 

d u„ 
ds 
or 

d 2 x m 1 d g i k dx 1 dx k 


i^uV = 0, 

2 dx m 


(11.30) 


(11.31) 


(11.32) 


= 0. 


(11.33) 


ds 2 2 dx m ds ds 

Equation (11.33) is an alternative form of the geodesic equations, which does not explicitly 
involve the Christoffel symbols. Using the geodesic equations (1 1.25), we may also write 


du n „ ,dx k /9 m" „ A dx* 
-r- + rlu'-r- = — r + K-U l -r- = 


ds 


ds V dx k 


ds 


0. 


(11.34) 


or, using the definition of the covariant derivative of a contravariant vector (8.23), 


Du" 9u" „ , 

Dx* _ 9x* r '*“ ; 


(11.35) 


we find that the absolute derivative of the tangent vector (11.27), with respect to the 
parameter s is equal to zero. 


Du" 

ds 


= 0. 


(11.36) 


This means that if we translate the vector u m along the geodesic line from the point x m to 
the point x m + dx' n it will coincide with the tangent vector u m + d u m at the point x m + dx" 1 . 
This is a specific property of the tangent vector u m along the geodesic line. In the Descartes 
coordinates, where T2 = 0, the geodesic differential equation becomes 
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d V 

ds 2 


= 0 . 


The solution of this equation is a set of linear equations 

y n = a n s + b n , 


(11.37) 


(11.38) 


which represent the parameter equations of a straight line, which confirms our earlier 
statement that the geodesic lines in the Euclidean metric spaces are straight lines. 



The Curvature Tensor 


12 


12.1 Definition of the Curvature Tensor 


In the Riemannian space the parallel translation of a vector between two given points 
is a path-dependent operation, that is, it gives different results along different paths. In 
particular, if a vector is parallelly translated along a closed path back to the same point 
of origin it will not coincide with the original vector. Let us therefore derive a general 
formula for the change of a vector after a parallel translation along some infinitesimally 
small closed contour C. This change, denoted by A A m , is obtained using the result (8.19), 
in the following manner: 


A A m = j>^A m = £ r" pA n dx p . (12.1) 

Using the Stokes theorem, this line integral over the closed contour C can be transformed 
into a surface integral over a surface S bound by the closed contour C, that is, 


Ad ™ — - j> c | Dk (jmfAn) — Dp (r^jA,)] dS kp , 
where we use the covariant operator 


D k = 


D 

Dx k 


( 12 . 2 ) 


( 12 . 3 ) 


( 12 . 4 ) 


Let us also, for the sake of simplicity, introduce the non-tensor operator 

dk ~ dx k ' 

Using the definition of the covariant derivative of a second-order covariant tensor (8.33), 
we may then write 

DjcCjnp = 3 k C m p — r InfcCip — TpfcQn/ 

DpC/nk = 3 pC m k ~~ ^rnpClk ~ I k p^ml 

Subtracting the two equations (12.5) from each other we obtain 


( 12 . 5 ) 


D k c m p DpC mk — d k c m p T mk Cip r pk c m i dpC mk + r mp Cik + r kp c m i 
= 3 k Cmp ~ dpC mk: + r mo Qfc — r mk Cip + r kn c m i — r nk c m i 


( 12 . 6 ) 


Using here the symmetry of the Christoffel symbols with respect to their lower indices, we 
see that the last two terms in (12.6) cancel each other. Thus, using the symmetric surface 
tensor dS kp — dS pk , we may write 
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( DfcCmp ~ DpC m ic) dS kp — (d k C m p ~ dpC mk + — T^C/p^ dS k P. (12.7) 

By interchanging the indices k ** p in one of the last two terms in the parenthesis and 
using the symmetry of the surface tensor dS kp , we see that these two terms cancel each 
other. The result (12.7) therefore becomes 

(JA ('nip — Dp C m k) dS kp = (3 k^mp d p C mk ) d S k P (12.8) 

Applying (12.8) with C mp = T " np A n in (12.2), we obtain 

a Am = \j> c [a* (r"pA„) - dp (r n mk A n )\ dS k >\ (12.9) 

or 

AA,n = \ £ (^mpAn - dpK^n + r" np d k A n - T" fc 3 pA n ) dS k P (12.10) 

On the other hand, the change of the vector A, , along the contour C is due to the parallel 
translation (8.19), and is given by 

SA n = r IpAtdxP =>• dpAn = r l np A h (12.11) 

Substituting (12.11) into (12.10), we obtain 

A Am = (dk^pAn - d P r’; nk A n + r" p r IjAt - r n mk r l np A,) d s k ? 02.12) 

As the labels of the dummy indices are irrelevant, we can interchange the indices n o- l in 
the last two terms of the integral (12.12). Thus, we obtain 

A Am = \j> c (3*r"p - 3 p r^ + r' mp r" k - r l mlc r " p ) A n d s k >> (12.13) 

Introducing here the notation 

Krkp = 9* r "p - 3 pF n mk + r mp r ik - rl mk r ip’ d2.14) 

the result (12.13) becomes 

A A m = i £ R n mkpAn d S k P. (12.15) 


Since the closed contour C is infinitesimally small, it is possible to replace the integrand of 
the integral (12.15) by its value at some point enclosed by the contour C, and to bring 
it outside the integral. Thus, we finally obtain a general formula for the change of a 
vector after a parallel translation along some infinitesimally small closed contour C in 
the form 

A A m = \R n mk pA n AS k P . ( 12 . 16 ) 

The mixed fourth-order tensor, defined by the expression (12.14) is called the curvature 
tensor of a given metric space. The tensor character of the curvature tensor is evident from 
(12.16), since A, l is a covariant vector, A S kp is a contravariant surface tensor and A A m is 
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a difference of two covariant vectors at the same point on the contour C. The curvature 
tensor plays the key role in the theory of gravitational field and is very important in the 
tensor analysis in general. 

The formula, analogous to (12.16), for a contravariant vector A m can be obtained 
using the fact that a scalar A m B m is invariant under parallel translations. Thus, we 
may write 

A (A m B m ) = A A m B m +A m AB m = A A n B„ +A m AB m = 0. (12.17) 

Using here the result (12.16) we obtain 

A A n B n + A m l -R n mkp B n AS k P = (a A n + l -R n m kp A m AS k P ) B n = 0. (12.18) 

As the covariant vector B n is arbitrary, we obtain the formula 

AA n = -^R n mkp A m AS kp . (12.19) 

In a given Euclidean space the curvature tensor is identically equal to zero, since it is 
possible to choose the coordinates where F pip = 0 and therefore R" nkp = 0, in the entire 
metric space. Due to its tensor character, the curvature tensor is then equal to zero in 
any other coordinate system defined in the Euclidean metric space. It is related to the 
observation that, in the Euclidean metric space, the parallel translation is not a path- 
dependent operation and the parallel translation along a closed curve C does not change 
the translated vector. 

The reverse argument is valid as well. If the curvature tensor is equal to zero in the 
entire metric space, that is, R" nkp = 0, then such a space is Euclidean. Indeed, in any 
metric space, it is possible to construct a local Descartes system in an infinitesimally 
small portion of that space. On the other hand, if R n mkp = 0 in the entire space, then the 
parallel translation is a unique path-independent operation by which this infinitesimally 
small portion of a given space can be translated to any other portion of that space. 
Thus, we may construct Descartes coordinates in the entire space and the given space is 
Euclidean. 

As a summary, the criterion for determining the character of a space is that a metric 
space described by a metric tensor g mn is Euclidean if and only if R n mkp = 0. 


12.2 Properties of the Curvature Tensor 

From the definition of the curvature tensor 

Knkp = - 3 P r" fc + r l mp rf k - r l mk r" p , (12.20) 

it is easily seen that it is antisymmetric with respect to the interchange of the last two lower 
indices k -o- p, such that we have 


Rtl _ _ D/7 

11 mkp 11 mpk * 


( 12 . 21 ) 
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It is also possible to show that the curvature tensor satisfies the following identity: 

pu . p n , p n r» 

n mkp ' n pmk ' n kpm u * 

In order to prove the cyclic identity (12.22), let us rewrite (12.20) as follows: 

p Yl o pH pH p/ pH F^ 

n mkp ~ °k L rnp °P L m k ' 1 mp l Ik 1 mk l Ip 
Rn pmk = dmr Zk ~ + rl p k r im - 

m = 3„r" -9 m rf +r[ r”-r' r" 

kpm r km kp km Ip kp Im 


( 12 . 22 ) 


(12.23) 


By inspection of (12.23), using the symmetry of the Christoffel symbols with respect to the 
interchange of the two lower indices, we see that for each term in these three equations 
there is a counterterm with the opposite sign. Thus, by adding together the three equations 
(12.23) we obtain the cyclic identity (12.22). In addition to the mixed curvature tensor 
(12.20), it is sometimes useful to define the covariant curvature tensor by 

Rmnkp = SmlR n icpi (12.24) 

or using the definition (12.20) 


R 


mnkp — gml^k^ up gmfipR nk T T n pT m,lk ^ nk ^ rn,lp ■ 


(12.25) 


In order to obtain a more symmetric form of the covariant curvature tensor (12.24), we use 
(9.25) in the form 


dpgml — R l, mp T B m ,lp 
dkgml = Rl,mk + I'm, Ik 

and the definition of the Christoffel symbol of the first kind (9.28) in the form 

T m,np = ~ (ppgmn 4- dngpm ~ dmgnp) • 

Using (12.26) and (12.27) we can calculate 

gmldk^hp = gmldk (jf lS Rs,np^ = ( gmfikg B s,np + ^m^k^s,np 

— — ( g^dkgml'j Rs,np + dkl'm.np = — I'h.p^kgml + 3 k^m,np 

— ~I'np (T l,mk + I'm, Ik) + 2 ipk^pgmn + dk^ngpm ~ dk^rngnp) 

By interchanging the indices lop in (12.28) we also obtain 

gmldpR l n k = — E l nk (r; mp + r m ,Zp) 

+ 2 (dpdkgmn + dpdngkm ~ dpdmgnk) ■ 

Substituting (12.28) and (12.29) into (12.25) we obtain 

Rmnkp = — Rnp (T l,mk + T m,lk ) + nk (B l,mp + B m ,lp) 


(12.26) 


(12.27) 


(12.28) 


(12.29) 
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+ — ( d/cdpgmn + ^k^nSpm ~ ^k^mSnp) 



( 12 . 30 ) 



or 



( 12 . 31 ) 


Finally, we obtain the alternative expression for the covariant curvature tensor in the form 



( 12 . 32 ) 


From the expression (12.32) we see that the covariant curvature tensor is antisymmetric 
with respect to the interchange of both the first two indices (m o- n) and the last two 
indices {k-^-p). Thus, we have 



( 12 . 33 ) 


From the expression (12.32) we also see that the covariant curvature tensor is symmetric 
with respect to the interchange of the first two indices and the last two indices {mn ** kp), 
that is, 



( 12 . 34 ) 


Furthermore, we note that the cyclic property (12.22) also remains valid after the contrac- 
tion with the covariant metric tensor, such that we have 



( 12 . 35 ) 


Using (12.33) and (12.34), it can also be shown that the cyclic properties analogous to 
(12.35) are valid not only for the last three, but also for any three indices of the covariant 
curvature tensor. 

12.3 Commutator of Covariant Derivatives 

In the Descartes coordinates, with D k — d k , it is allowed to change the order of the 
covariant differentiation with respect to the two different coordinates. In other words, 
we may write 


3jfc3 mA p = 3 m 3 =>■ D k D m A p = D m D k A p 


( 12 . 36 ) 
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or 

{D k D m - D m D k )AP = (d k d m - d m d k )AP = 0, (12.37) 

where A p is an arbitrary contravariant vector. Since AP is an arbitrary contravariant vector, 
it can be omitted and we can write the corresponding operator expression as follows: 

D k D m — D m D k = [ D , n , i-C ] = [dm, 3,t] = 0- (12.38) 

The quantity [D m ,D k ], introduced in (12.38), is called the commutator of the two oper- 
ators D m and D k . This commutator is equal to zero in the Descartes coordinates. Let us 
now calculate the commutator [D m ,D k \ in the arbitrary curvilinear coordinates. In order 
to calculate this commutator, we first calculate 

D k D m AP = D k (d m AP + r p mn A n ) 

D k D m AP = d k (d m AP + r p nn A n ) 


+r p kl ( dm A‘ + r\ nn A n ) + r l k jdiAP + r p n A») 
DkDmAP = d k d m AP + r P ln d k A n + A n d k T^ nn 

+ r ki3 m A l + r&r l mn A" - r[ mdl AP - r[ m r p n A" 


(12.39) 


Analogously, we may write 

D m D k AP = d m d k AP + T P kn d m A n + A n d m r p kn + r P mJ d k A l 

+ r ml T krA n - rl mk d l AP ~ ^ lA"- ( 12 . 40 ) 

If we now form the difference between the expressions (12.39) and (12.40), we see 
that several terms cancel each other, that is, the first term cancels the first term, the 
second term cancels the fourth term, the fourth term cancels the second term, the 
sixth term cancels the sixth term and the seventh term cancels the seventh term. Thus, 
we obtain 

(D k D m - D m D k )A p = - d m r p n + r p ,r l mn - r p nl r l nk ) A n . (12.41) 

Substituting now the definition of the curvature tensor (12.14), that is, 

Kkm = ~ d m T P n + - T^, (12.42) 

into (12.41), we obtain 

(D k D m - D m D k )AP = R P km A n . (12.43) 

In line with the criterion for determining the character of a space, discussed above, we 
conclude from (12.43) that the change of the order of the covariant differentiation is 
allowed only in the Euclidean metric spaces with the zero curvature tensor, that is, with 
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12.4 Ricci Tensor and Scalar 


From the curvature tensor (12.14), by contraction of the single contravariant index with 
the second covariant index, it is possible to construct a covariant second-order tensor, 
called the Ricci tensor, as follows: 


Rmn — R, 


mnp 


= -R‘ 


mpn • 


(12.44) 


Using here the definition of the curvature tensor (12.14) we may write 


Rmn — ' ( hi r 


P 


n l pm ' 


dp^mn + r 


-P 


mp nl 


pP pZ 

1 mn l i p - 


(12.45) 


It is easily verified that the Ricci tensor can only be defined as in (12.44). Let us, for 
example, consider the alternative contracted tensor R k kmn , which can be written as follows: 


Rjcmn — 4 R/cmn ~ S P SplRknm ~ ^ pkmn ~ 0- (12.46) 

From the definition (12.46) we see that is a scalar product of the symmetric metric 
tensor g t>k and the antisymmetric covariant curvature tensor R p kmn< which is by definition 
equal to zero. Using the symmetry properties (12.33) and (12.34) in the definition of the 
Ricci tensor 


Rn 


— Rmnp ~ g P Rkmnp’ 


(12.47) 


and interchanging the dummy indices k ** p, we may write 

Rmn — g P Rnpkm = § P Rnkpm = S P Rknmp = Rnm ■ (12.48) 


Thus, the Ricci tensor is symmetric with respect to its two indices, that is, 


Rmn — Rnm (jn, n — 1,2,... ,N). 


(12.49) 


Using the Ricci tensor (12.44), we can define the Ricci scalar as follows: 

R = g m ' l Rmn = g mn g pk Rkmnp- (12.50) 

The Ricci tensor and Ricci scalar are extensively used in the general theory of relativity and 
cosmology. 


12.5 Curvature Tensor Components 


The components of the curvature tensor are related to each other by means of the 
symmetry relations (12.33)— (12.35). Thus, the components of the curvature tensor are not 
all independent, and the number of the independent components of the curvature tensor 
in the N - dimensional space is equal to 


n = 


N 2 (N 2 - 1 ) 

12 


(12.51) 


As this general result for the N - dimensional metric space will not be extensively 
used and its derivation involves long and complex combinatorial manipulations, 
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it will not be considered in detail here. We will only consider the two simplest 
special cases, a two-dimensional metric space (IV = 2) and a three-dimensional metric 
space (N — 3). 

In a two-dimensional metric space, the indices of the covariant curvature tensor Rkmnp> 
that is, ( k , m, n, p), can assume the values 1 and 2. Thus, the components of the covariant 
curvature tensor can be presented by the following rectangular scheme: 


[*A:m/ip] — 



np = 11 

np = 12 

np = 21 

np = 22 

km = 11 

0 

0 

0 

0 

km = 12 

0 

*1212 

-*1212 

0 

km = 21 

0 

—*1212 

*1212 

0 

km = 22 

0 

0 

0 

0 


(12.52) 


From the scheme (12.52) we see that out of possible 2 4 = 16 components, the twelve 
components with k= m or n = p or both are equal to zero due to the antisymmetry of 
the covariant curvature tensor. The remaining four non-zero components with k / m 
and n^p are related by antisymmetry relations as well. Thus, according to the expression 
(12.51), we see that in a two-dimensional metric space there is just one independent 
component of the covariant curvature tensor, that is, 


4(4-1) 

n = = 1. 

12 


(12.53) 


The Ricci scalar is then defined by 


R = g mn g Pk Rkmnp = 2g n g Z2 R 12 l2 - 2g 12 g 21 Ri 2 i2 

R = 2 (g n g 22 - g 12 g 21 ) *1212 = 2 \g mn \ Run = ~Ri 2 i 2 (12.54) 

The quantity R/2 is equal to the Gauss curvature of a surface, or the inverse of the product 
of the main radii of the curvature. 

Let us now consider a special case of the surface of the unit sphere, with the metric of 
the form 

ds 2 = d0 z + sin 2 ddip 2 , (12.55) 


or 


ds 2 = (dx 1 ) 2 + (sinx 1 ) 2 (dx 2 ) 2 . 

Thus, the covariant metric tensor can be written in the following matrix form: 

[gmrc] = 

and the contravariant metric tensor can be written in the following matrix form: 

[g mn ] = 


1 0 

0 (sinx 1 ) 2 


1 0 

0 (sinx 1 ) -2 


(12.56) 


(12.57) 


(12.58) 
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In this case, the quantity R/2 should be equal to unity. In order to show that, we need to 
calculate the only independent component of the curvature tensor, that is, 

*1212 = -^3i3ig 22 + gn(F 1 1 2 r 1 1 2 - r’,r’ 2 ) ( 12 . 59 ) 

+ g 22 (ri 2 r^ 2 - r^rf 2 ). 

On the other hand, the Christoffel symbols of the first kind are defined by 

r m,np — 2 (dpgmn + 3 ngpm ~ dmgnp)- (12.60) 

Substituting (12.56) into (12.60), we obtain 

r U i = 0 , 

Tl.12 = r 1,21 = 0, 

F i ,22 = -|3lg22 = -sinx 1 cosx 1 , 

(12.61) 

T 2.11 = 0 , 

F 2,12 = r 2 ,21 = + 2 3lg22 = + sinx 1 cosx 1 , 

T 2,22 = 0 . 

The Christoffel symbols of the second kind are defined as 

r™ = g m/ r ; ,„ p , (12.62) 

and in our example, they are given by 

Tj 1 ! =g 11 ri,n+g 12 r 2 > ii = 0, 

r i2 = r 21 = g Ur 1.21 + g 12 T 2 ,21 = 0, 

r 22 = g 11 T 1,22 + g 12 T2,22 = - sinx 1 COSX 1 , 

(12.63) 

r?! = g 21 r U i+g 22 r 2 ,ii = 0 , 

r i2 = r 21 = g 21 r U2+g 22 r 2 ,i2 = +(sinx 1 )- 1 cosx 1 , 

r| 2 = g 21 Ti,22 + g 22 r 2 ,22 = 0. 

Substituting the results (12.63) into (12.59), we obtain 

*1212 = — 2 9 l3lg22 + g22(T 2 2 ) 2 , 

*1212 = (sinx 1 ) 2 — (cosx 1 ) 2 + (sinx 1 ) 2 (sinx 1 ) -2 (cosx 1 ) 2 , (12.64) 

*1212 = (sinx 1 ) 2 . 

Finally, the Gauss curvature of the unit sphere is given by 

§ = -*1212 = (sinx 1 ) -2 (sinx 1 ) 2 = 1, 

2 g 

which proves that the Gauss curvature of the unit sphere is indeed equal to unity. 


(12.65) 
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In the case of the three-dimensional space (IV = 3), in which the curvature tensor has 
a total of 3 4 = 27 components, according to (12.51) there are six independent components 
of the covariant curvature tensor. 


9(9 - D c 

n = = 6. 

12 


( 12 . 66 ) 


There are also six independent components of the Ricci tensor R mn . Using the suitable 
Descartes coordinates in a given point, it is always possible to make three of these six 
components equal to zero. In particular, it is possible to diagonalize the Ricci tensor and 
define the curvature in any given point of a three-dimensional space by three independent 
quantities. In the four-dimensional space there are 20 independent components of the 
covariant curvature tensor. 


16 ( 16 - 1 ) 

n = — = 20. 

12 


( 12 . 67 ) 


The case of a four-dimensional space is particularly important in the general theory of 
relativity and cosmology. 


PART 3 


Special Theory of Relativity 




Relativistic Kinematics 


13 


13.1 The Principle of Relativity 

For the description of the motion of particles, it is necessary to have a system of reference. 
By a system of reference we mean a coordinate system to which we attach a clock. The 
coordinate system is used to determine the positions of particles in space and the clock is 
used to measure the times at which the positions of particles in space are measured. There 
are systems of reference where the free motion of a particle, that is, the motion of a particle 
on which there is no action of any external forces, is such that the particle has a constant 
velocity. Such systems of reference are called the inertial systems. 

If two systems of reference are translated with a constant velocity with respect to each 
other and one of them is an inertial system, then the other system of reference is also an 
inertial system. Any free motion in that system of reference will also be along a straight 
line with constant velocity. 

Using the above definitions, we can write down the statement of the special principle 
of relativity as follows: 

All laws of nature are equal in all inertial systems of reference. In other words, 
the equations which express the laws of nature are invariant with respect to the 
transformations of spatial coordinates and time from one inertial system of reference 
to another. 


13.2 Invariance of the Speed of Light 

Let us consider two bodies interacting with each other and let us assume that on one of 
the bodies some event has occurred (explosion, distance increase, or some other change). 
Then this event will be noticed at the position of the other body after the lapse of some 
time. If the distance between these two bodies is divided by this time interval, we obtain 
the maximum speed of interaction. 

In nature, the motion of bodies with a speed higher than the maximum speed of 
interaction is not allowed, since by means of such a body it would be possible to achieve 
the interaction with a speed higher than the maximum speed of interaction. Based on 
the special principle of relativity, we conclude that the maximum speed of interaction is 
invariant with respect to the transformations of spatial coordinates and time from one 
inertial system of reference to another. Thus, it is a universal constant of nature. It is shown 
that this universal constant is equal to the speed of light in the vacuum, and its numerical 
value is given by 
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c = 2,99792458- 10 8 — . (13.1) 

s 

The principle of invariance of the speed of light is therefore stated as one of the basic 
principles of relativistic mechanics. According to this principle the speed of light is 
independent on the motion of the light source, that is, on the choice of the system of 
reference in which the motion of light is described. In the next section, the mathematical 
formulation of this principle is presented. 

13.3 The Interval between Events 

An event in nature is determined by four coordinates. These include the three space 
coordinates of the position where the event has taken place and the time coordinate when 
the event has taken place. Each event is represented by a point, called the world point of 
the event, with the coordinates 

x m = |x°, x 1 , x 2 , x 3 } (m = 0, 1,2,3). (13.2) 

The first coordinate is a time coordinate defined by x° = ct and it has the dimension of 
length. The other three coordinates are the spatial Descartes coordinates 

x a = {x,y,z] (a = 1,2,3). (13.3) 

We will from now on use Latin indices for the coordinates of the four- dimensional space- 
time (13.2), and Greek indices for the usual three-dimensional spatial coordinates (13.3). 

The set of all world points constitutes a four-dimensional manifold called the world. To 
each particle in such a world, there corresponds a line called the world line. The world line 
of a particle at rest is a line parallel to the time axis. 

In order to express the principle of invariance of the speed of light mathematically, let 
us now consider two systems of reference denoted by K and K', which move with respect 
to each other along the common x-axis with some constant velocity V. These coordinate 
systems are shown in Figure 13.1. The times in coordinate systems IT and IT' are denoted by 
t and f, respectively. Consider now two events, where one event consists of a signal sent at 
time t\ , with a constant velocity equal to the speed of light, from the point with coordinates 
{X] , j/] , Z] } in the coordinate system K. The other event consists of the same signal being 
received at time t 2 and at the point with coordinates \x 2 ,y 2 , z 2 ) in the coordinate system K. 
In the coordinate system K, the coordinates of these two events are related by the following 
equation: 

c 2 (t 2 - hf - (x 2 -xi) 2 - (y 2 -yif - (z 2 -zD 2 = 0. (13.4) 

Due to the principle of invariance of the speed of light, in the coordinate system K' the 
coordinates of these two events are related by the analogous equation 

c 2 (4 - ff) 2 - (x' 2 - x' 1 ) 2 - (y 2 - y[) 2 - (4 - 4 ) 2 = 0. 


(13.5) 
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y y' 



FIGURE 13.1 The coordinate systems K and K' . 


If the coordinates (?i,xi,yi,zi) and (t 2 ,X 2 ,y 2 , z 2 ) are the coordinates of any two events, 
then the quantity 

512 = [c 2 (f 2 - hf - (x 2 - X 1) 2 - (y 2 - yi) 2 - (Z 2 - Z 1 ) 2 ] ' (13.6) 

is called the interval between these two events. Analogously to (13.6), the square of the 
interval between two infinitesimally close events is determined by the metric 

ds 2 = c 2 At 2 — dx 2 — dy 2 — dz 2 = g mn dx m dx n , (13.7) 


where (m, n = 0,1,2, 3), and the contravariant space-time coordinates in this four- 
dimensional metric space are given by 

x m = {x°, x 1 , x 2 , x 3 } = {ct,x t y,z} . (13.8) 

The components of the covariant metric tensor g mn are given by the following matrix: 


[gm;i] — 


10 0 0 
0-100 
0 0-10 
0 0 0 -1 


(13.9) 


From (13.9) we conclude that the world is a four-dimensional pseudo-Euclidean metric 
space. The determinant of the metric tensor g mn is g = -1. The matrix of cofactors of the 
matrix (13.9) is 
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[G mn ] = 


-10 0 0 
0 10 0 

0 0 10 

0 0 0 1 


Using (6.13), the inverse matrix to the matrix [g mn \ is given by 


[g”’”] = inv [gmn] = 


adj [g mn ] _ [G mn y 


(13.10) 


(13.11) 


We conclude that the components of the contravariant metric tensor g mn are the same as 
the components of the covariant metric tensor (13.9), that is, 


[g™ 1 ] = 


10 0 0 
0-100 
0 0-10 
0 0 0 -1 


From (13.9) and (13.12) we see that 

g mk 8kn = C ( k > m,n = 0,1, 2, 3). 


(13.12) 


(13.13) 


The covariant space-time coordinates in this four- dimensional metric space are given by 
x m = gmnX n = {X 0 ,Xi,X2,X 3 } = [ct, -X, -y, -z] (13.14) 

Using (13.8) and (13.14) we see that 

x m x m = g mn x m x n = c 2 f 2 - x 2 -y 2 - z 2 . (13.15) 


Using expressions (13.4) and (13.5) and the principle of invariance of the speed of light, we 
conclude that an interval which is equal to zero in one inertial system of reference is also 
equal to zero in any other inertial system of reference. On the other hand the quantities ds 
and dV are two infinitesimally small quantities of the same order. Thus, we may write 

ds = ads', (13.16) 


where the coefficient a may depend only on the absolute value of the relative velocity 

V of the two inertial systems of reference K and K'. It may not depend on the spatial 
coordinates or time due to the assumption of the homogeneity of space and time in the 
inertial systems of reference. It may not depend on the direction of the relative velocity 

V of the two inertial systems of reference K and K' either, due to the assumption of the 
isotropic nature of space and time in the inertial systems of reference. Thus, as we may 
write (13.16), we may also write 

ds' = flds. (13.17) 

From (13.16) and (13.17) we obtain a 2 = 1 or a = ±1. On the other hand, from the special 
case of the identity transformation with ds = ds' we conclude that a = + 1 , so that we always 
have 


ds' = ds. 


(13.18) 
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Thus, we obtain the mathematical formulation of the principle of invariance of the 
speed of light (13.18), which implies the invariance of the interval with respect to the 
transformations from one inertial system of reference to another. 


13.4 Lorentz Transformations 

The objective of this section is to derive the formulae for transformations of coordinates 
from one inertial system of reference to another. In other words, if we know the coordi- 
nates of a certain event x m = { cf, x! , y 1 , z! } in some inertial system of reference K' , we need 
the expressions for the coordinates of that event z m = [ct,x,y,z] in some other inertial 
system of reference K. As x m is a contravariant vector, it is transformed according to the 
transformation law 

r)Z m 

z m = x n = Ajfx". (13.19) 

dx n n 

The transformation (13.19) is a linear transformation and the coefficients A are inde- 
pendent on coordinates. The system Ajf is a mixed second-order tensor in the pseudo- 
Euclidean metric space defined by the metric (13.7), since it is defined with respect to the 
linear transformations (13.19). 

In order to calculate the components of the tensor A j” , we now introduce an imaginary 
time coordinate 

r = ict i = V— T, (13.20) 

and an imaginary metric 

da = ids i = V-l. (13.21) 

Thus, the metric of the space becomes 

da 2 = dr 2 + dx 2 + dy 2 + cfz 2 , (13.22) 

and we thereby define the four-dimensional Descartes coordinates. The linear transfor- 
mation (13.19) must keep the metric ds or da invariant. As we have argued before, the 
only transformations of Descartes coordinates that keep the metric form invariant are the 
transformations of translation, rotation and inversion. 

Parallel translation of four-dimensional Descartes coordinates is not a suitable can- 
didate for the transformation (13.19), since it merely changes the origin of the spatial 
coordinates and the origin for measurement of time. Similarly, the inversion of the 
four-dimensional Descartes coordinates is not suitable either, since it merely changes the 
sign of the spatial coordinates and time. 

Thus, we conclude that the only suitable candidate for the transformation (13.19) is a 
rotation of four-dimensional Descartes coordinates. We are looking for the formulae for 
transformation from the coordinate system K' to the coordinate system K as shown in 
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Figure 13.1. In that case we have y = y' and z — z’ and we are only interested for the rotation 
in the rx plane, as shown in Figure 13.2. From Figure 13.2, we immediately obtain the 
remaining two transformation formulae 


x = x! cos x/s — t' sin i/r, 
r = x? sin f + r 1 cos x/r. 


(13.23) 


According to Figure 13.1, the coordinate system K' is moving with respect to the coordinate 
system K with a constant velocity V. If we then consider the motion of the origin of the 
coordinate system K' we then have x! = 0. Then from (13.23) we obtain 


x .V 
tan xjf = — = i — . 


Using now the trigonometric formulae 

1 


COS x/r 


' C 


sini/f = 


tani/f 


v 7 1 + tan 2 i fr v 7 1 + tan 2 V'" 


we obtain 


cos x]/ = 


/ 


i-5 


Substituting (13.26) into (13.23) we obtain 


sim/f = 


a 


/ 


i-5 


(13.24) 


(13.25) 


(13.26) 


r' + i(V/c)x' 



t' + (V/c 2 )x' 
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x = 


x' — i(V /c) x' 



x! + Vf 



y = y' y = V 

z = zl z = z' 


(13.27) 


The results (13.27) are the well-known formulae for transformations of coordinates from 
one inertial system of reference to another. They are called the Lorentz transformations. 
In the special case when V <£. c, the Lorentz transformations are reduced into the so called 
Galilei transformations of non-relativistic mechanics 


t = t', 

x = xf + Vt, 

(13.28) 

y=y. 

z = zl . 

Although the Galilei transformations are closer to our everyday experience than 
the Lorentz transformations, they are not in accordance with the principle of rel- 
ativity and they do not leave the metric form of the four-dimensional space-time 
invariant. 

Using (13.27), it is easy to construct the tensor of Lorentz transformations Af in the 
following matrix form: 


1 

V/c 

0 

0 ' 







Vic 

1 

0 

0 


\/ 1_ 7r 


(13.29) 

0 

0 

1 

0 


0 

0 

0 

1 



The inverse of the tensor A denoted by (A'f) 1 , is given in the following matrix form: 



(13.30) 


and it is obtained from the tensor (13.29) by reversing the sign of the relative velocity V, 
that is, by putting V—? — V. Thus, we may write 
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9r m 

— = (AjfrHV) = A«(-V). 


It is easily shown, by direct multiplication of the matrices (13.29) and (13.30), that 


\ m / \ k\ — 1 _ s m 
A k ( A n> = ■ 


(13.31) 


Q3.32) 


13.5 Velocity and Acceleration Vectors 

In the special theory of relativity, the time differential At is not a scalar invariant, and the 
usual definition of the three-dimensional velocity 

dx" 

i? = -r- (a = 1,2,3), (13.33) 

at 

is less useful, since it does not behave as a vector with respect to the transformations from 
one inertial system of reference to another. Therefore, we introduce a four- velocity, as a 
contravariant vector 

T)x' n dx m 

u m = — — = — - (m = 0, 1,2,3). (13.34) 

ds ds 

On the other hand, by definition, we have 

ds 2 = g mn dx m dx n = c 2 dt 2 + g a p dx" dx 

= c 2 df 2 — dx 2 — dy 2 — dz 2 , (13.35) 


and the square of the intensity of the three-dimensional velocity is given by 

2 dx 2 + dy 2 + dz 2 B 

v 2 = k = -gcpifvP. 


From (13.35) with (13.36), we obtain 


d2 


v 2 \ 


gaff^VP 


ds = c 2 df 2 ^1 - j = c 2 dt 2 I 1 + 

Substituting (13.37) into (13.34), we obtain 

dx m 

u m = == (m = 0, 1,2,3). 

c d^l-^ 

The temporal zeroth component of the four- velocity is then given by 

n 1 


and the three spatial components of the four- velocity are given by 

a ^ 

U = 




(13.36) 


(13.37) 


(13.38) 


(13.39) 


(13.40) 
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From (13.39) and (13.40) we conclude that the components of four-velocity are not 
independent on each other, but satisfy the equality 


u m u m = g mn u m u n 


gmn d-d” dx" d ^ 2 _ 1 


(13.41) 


since we have 

ds 2 = g nm cbc m dx". 


(13.42) 


The four- acceleration of a particle is defined as a contravariant vector 

D u m d u m 


w"‘ = 


ds ds 


(m = 0, 1, 2, 3) 


Using here (13.38), we may write 


w"‘ = 


1 d if 


i/2 dt 


or 


1 dx" 




i/ 2 dt 


The temporal zeroth component of the four- acceleration is then given by 

n 1 d 1 


and the three spatial components of the four- acceleration are given by 

1 d 


c 2 Jl- 4 dt 


(13.43) 


(13.44) 


(13.45) 


(13.46) 


(13.47) 


From (13.41), we conclude that the vector of four- acceleration is always orthogonal to the 
vector of four-velocity. This can be shown by differentiating (13.41) with respect to the 
metric, that is, 


d / m du m „ du n 

^ ( gmn M M ) = gmn ~r~ U n + gmn U m ~t~ = 0, 

d u m „ d u n m n du m 

gmn -X— U n + g nm —— U m = 2g mn - r - U n = 0, 


ds ° ds 
g mn U) m U n = w n u n = 0. 


ds 


(13.48) 


Thus, the vectors w n and u" are indeed orthogonal to each other. 



Relativistic Phenomena 



14.1 Introduction 

The principle of relativity gives rise to a major change in our understanding of space and 
time. This change is reflected in a number of relativistic phenomena that contradict our 
everyday experience. The main idea behind all of these phenomena is the claim that the 
observers, in relative motion with respect to each other, have different perceptions of 
space and time. The objective of this chapter is to provide a brief discussion of some of 
the most important relativistic phenomena. 

14.2 Time Dilatation 

According to the principle of relativity, two identical clocks attached to two different 
observers, in relative motion with respect to each other, tick at different rates and 
consequently provide different results for the time interval between two given events. The 
relativistic time dilatation is the observation that the measured time interval between two 
events appears to have a minimum value in the inertial system in which the measuring 
clock is at rest. That minimum time interval in the rest frame of the clock is usually referred 
to as proper time interval. 

In order to describe the phenomenon of time dilatation mathematically, we consider 
two systems of reference denoted by K and K' , which move with respect to each other 
along the common x-axis with some constant velocity V, as depicted in Figure 13.1. 

The times in the two coordinate systems K and K' are denoted by t and f, respectively, 
and we assume that the origins of the coordinate systems K and K' coincide at a 
synchronized time instant t = ? = 0. Let us now attach a stationary clock to the origin of 
the coordinate system K' , moving with constant velocity V with respect to the origin of the 
coordinate system K, along the common x-axis. In the coordinate system K' the stationary 
clock ticks with the proper time period At? = t? 2 — t\ . On the other hand, when observed 
from the coordinate system K, the same clock ticks with some other period At = t 2 — t\. 

Next we choose the first tick of the moving clock to occur at the time instant t — t? = 0 
when the origins of the two coordinate systems K and K' coincide, whereby we define the 
first event (A) with coordinates 

x% = { 0 , 0 , 0 , 0 ) , x^ = { 0 , 0 , 0 , 0 ) , ( 14 . 1 ) 

in reference frames K and K' , respectively. As the second event (B), we choose the next tick 
of the clock with coordinates 


Tensors, Relativity, and Cosmology. http://dx.doi.org/10.1016/B978-0-12-803397-5.00014-5 
Copyright © 2015 Elsevier Inc. All rights reserved. 


105 


106 TENSORS, RELATIVITY, AND COSMOLOGY 


= {cAt, V At, 0,0], x A m = {cAf, 0,0,0} , (14.2) 

in reference frames K and K', respectively. In the first of equations (14.2) we note 
that in the coordinate system K, after the time period At the moving clock has the x- 
coordinate V At. By definition (13.6) the metric intervals between the events A and B are 
given by 

sab = [c 2 At 2 - V 2 At 2 j , s^g = c At', (14.3) 

in reference frames K and K', respectively. By the principle of relativity (13.18) we know 
that the metric interval is invariant such that 


sab = c 


i V 2 

A h/l- TT = cAZ " = S AB’ 


(14.4) 


or 


At = 


At' 



> At', 


0 < V < c. 


(14.5) 


From the result (14.18) we readily see that the time interval At' in the rest frame K ' is 
smaller than the corresponding time interval measured in any other reference frame K 
moving with relative speed V (0 < V < c) with respect to K'. Thereby we confirm that the 
measured time interval between two given events indeed appears to have a minimum 
value in the rest frame of the measuring clock. 

For example, the result (14.18) implies that an Earth-bound observer (reference frame 
K ) would measure a considerably longer duration At of a trip taken by a very fast space 
traveler (V~ c) as compared to the duration of the same trip At' measured by the clock 
attached to the space traveler herself (reference frame K') . 


14.3 Length Contraction 

Another interesting consequence of the principle of relativity is that the length of an object 
does not have an absolute meaning. The length of an object depends on its motion with 
respect to the coordinate system in which it is measured. The relativistic length contraction 
is the observation that the length of a moving object, measured along the direction of 
motion, appears to have a maximum value in the inertial system in which the object is 
at rest. Such a maximum length in the rest frame of the object is usually called the proper 
length of the object (Zo). 

In order to describe the phenomenon of length contraction mathematically, we again 
consider the two systems of reference denoted by K and K ', as depicted in Figure 13.1. 
The times in the two coordinate systems K and K' are denoted by t and t', respectively, 
and we once more assume that the origins of the coordinate systems K and K' coincide 
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at a synchronized time instant t — f = 0. Let us now place a thin object (a rod) of proper 
length Zo at rest along the x-axis of the coordinate system K, such that the first end of the 
rod (A) is situated at the origin of K and the second end of the rod (B) is situated at the 
spatial position x — Ax = Zo in K. 

As the first event (A) we now choose the passing of the origin of K' by the first end of the 
rod (A). This occurs when the two origins of K and K' coincide, such that the coordinates 
of that event are given by 

x™ = {0,0,0,0} , x A m = {0,0,0,0), (14.6) 

in reference frames K and K' , respectively. As the second event (B), we choose the passing 
of the origin of K' by the second end of the rod (B). This occurs when the origin of K' has 
traveled the length of the rod Z (Z / Zo) that we expect to measure in K' . The time At 1 , that 
it takes to travel the length Z between the two ends of the rod with a constant velocity V, as 
measured in K' , is given by 

At' = ^ (Ax' = 0), (14.7) 

where we note that in K' both events occur at the origin and the distance along the x-axis 
between the events A and B is equal to zero. The time At, that it takes to travel the proper 
length Ax = Zo between the two ends of the rod with a constant velocity V, as observed 
from K, is given by 


At - — - — 
V ~ V' 


The coordinates of the second event (B) in reference frames K and K' , respectively, are 
then 




(14.12) 



108 TENSORS, RELATIVITY, AND COSMOLOGY 


or 


Z=Z o yi- — < Z 0 , 0 < V < c. (14.13) 

From the result (14.13) we readily see that the proper length of the object lo in the rest 
frame K is larger than the length l measured in any other coordinate system K' moving 
moving with relative speed V (0 < V <c ) with respect to K. Thereby we confirm that the 
length of a moving object, measured along the direction of motion, indeed appears to have 
a maximum value in the rest frame of the object. In other words, the result (14.13) implies 
that a very fast spaceship ( V ~ c) appears for an Earth-bound observer to be considerably 
shorter (i.e., contracted) along the direction of motion. 


14.4 Relativistic Addition of Velocities 


Consider again the two systems of reference K and K' depicted in Figure 13.1. Now we 
assume that a point particle is moving with some arbitrary velocity i/ x (t) along die x'-axis 
of the coordinate system K'. The coordinate system K ' itself is moving along the x-axis of 
the coordinate system K with some constant velocity V. In classical mechanics one would 
expect that the two velocities V and i/ x add to a resulting velocity v x observed from the 
reference frame K, such that 

v x = V + i/ x . (14.14) 


It is obvious that this result cannot be valid in the relativistic mechanics, since a very fast 
object (i/ x ~ c) moving in the very fast coordinate system K' ( V ~ c) would appear to move 
faster than light in the reference frame K . In order to find the correct formula for relativistic 
addition of velocities, we use the Lorentz transformations (13.27), where we note that V is 
constant, whereby 


d t = 


At' + {V /c 2 )Ax' 



Then, by definition of velocity, we obtain 


dx = 


Ax' + V At' 



(14.15) 


or finally 


dx dx' + V At' (Ax' /At') + V 

At At' + (V / c 2 )Ax' 1 + (V /c 2 )(Ax' /At') ' 


K + v 

1 + (V /c 2 )v' x ’ 


(14.16) 


(14.17) 


which is the required relativistic formula for addition of velocities. For V < < c, it is reduced 
to the approximate classical formula (14.14), since in such a case (V ■ t^/c 2 ) can be 
neglected compared to unity in the denominator of the formula (14.17). From (14.17) we 
also note that the largest possible speed of the point particle remains the speed of light, in 
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any inertial frame of reference. Even when both i/ x = c and V = c, the resulting velocity is 
v x = c and does not exceed the speed of light. 


14.5 The Twin Paradox 


The phenomenon of time dilatation gave rise to an apparent paradox, known as the 
twin paradox. Suppose that two twins Mary and Kate are separated in such a way that 
Mary travels with a very fast spaceship (V = 0.98 c) to a nearby star and returns after 
AT = 10 years according to a clock attached to herself and thereby to her ship (reference 
frame K'). On the other hand, Kate remains on Earth (reference frame K) and according to 
(14.18) measures the time of Mary’s trip to be 


At = 



10 years 
0.2 


= 50 years. 


(14.18) 


If the two twins were 30 years old at the time of separation, upon her return Mary would 
be 40 years old whereas Kate would be double that age, that is 80 years old. Even though 
this outcome is very counter-intuitive and psychologically shocking for a layman, there is 
really no paradox here at all. It is just an example of the effect of time dilatation. In other 
words, it is simply the way nature really works according to the principle of relativity. 

On the other hand, according to the principle of relativity, there is nothing special about 
the system of reference K where Kate spent the time period At, compared to the system 
of reference K' where Mary spent the time interval AT. Any inertial frame of reference 
can be chosen as “stationary” such that every other inertial frame of reference is “moving” 
with respect to it. Thus, at a first glance, it seems to be possible to claim the opposite to 
the above description of the Mary’s trip. One could argue that it was not Mary traveling 
with velocity V = 0.98 c with respect to “stationary” Kate, but it was instead Kate that was 
traveling with velocity V = —0.98 c with respect to “stationary” Mary. In such a case Kate 
should be younger than Mary after the trip instead, since the sign of the relative velocity is 
irrelevant in the analysis of the time dilatation phenomenon. 

This situation gave rise to an apparent dilemma as to which of the two twins is older 
after the trip. In the case of an even faster spaceship, the question might read, which of the 
two twins is alive after the trip, given the normal age of the human beings. This apparent 
dilemma was considered as a paradox by some early critics of the theory of relativity. The 
detailed quantitative analysis of the above example of Mary and Kate, based on a specific 
choice of trajectory of Mary’s spaceship, is not included here. 

However, such a detailed quantitative analysis is really not needed to settle the issue, 
at least in principle. It is sufficient to realize that it is not possible for Mary to travel to a 
nearby star and return to Earth while staying in an inertial system of reference throughout 
her entire journey. 

The reciprocity between the two frames of reference K and K\ used in the above 
description of the twin paradox, is only valid if both reference frames remain inertial 
during the entire time between the two observed events (departure and return of Mary’s 
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spaceship). This, in turn, requires that the two reference frames K and K' are translated 
with constant velocity V relative to each other throughout the entire journey. This require- 
ment obviously cannot be satisfied by Mary if she is ever to return to Earth. Mary needs 
to decelerate her ship, stop and then accelerate again in the opposite direction along the 
same straight line. Alternatively, Mary could travel along a curved trajectory (e.g., a circle) . 
In both cases, the reference frame K' does not remain inertial during the entire trip. 

Thus, there is no reciprocity between the two frames of reference K and K ' , and a 
proper analysis of the problem at hand requires the application of the principles of the 
general theory of relativity covered in the Chapter 18 of the present book. In conclusion, 
as Kate remained in an inertial reference frame K between the two events (departure and 
return of the Mary’s spaceship), she has indeed aged more than Mary in agreement with 
the time dilatation effect described earlier. 



Relativistic Dynamics 



15.1 Lagrange Eguations 


Let us consider a free particle, that is, a particle that is not under the influence of 
any forces. The equations describing the motion of this particle are obtained using the 
variational principle. The action integral is defined by ( 1 1 .4) , that is, 


7 = 



(15.1) 


where u n is the four-velocity of the particle. The equations of motion of the particle are 
obtained using the variational principle, that is, the condition that the variation of the 
action integral is equal to zero, SI — 0. The Lagrange equations of motion of the particle 
are given by (11.12) in the form 


dL 
dx " 



= 0 (n = 0, 1,2,3). 


(15.2) 


Let us now define the form of the Lagrangian function for a free particle. Equations (15.2), 
derived using the variational principle applied to the action integral (15.1), are equal in 
all inertial systems of reference and the action integral is a scalar invariant with respect to 
the Lorentz transformations (13.29). Thus, the Lagrange function L itself is also a scalar 
invariant. Due to the homogeneity of the four-dimensional space-time in the inertial 
frames of reference, the Lagrange function cannot be a function of space-time coordinates 
x n . Thus, we may write 


L = L(u n ) (n = 0, 1,2,3). 


(15.3) 


The only invariant which can be created using the four- velocity vector u n is given by 

u n u n = gk n u k u n = 1 (k,n = 0, 1,2,3). (15.4) 


Thus, the action integral of a free particle is simply proportional to the arc length in the 
four-dimensional space-time manifold, that is, 


f SB , r SB / dx k dx n , 


L *- mc L 

(15.5) 


where m is a mass parameter of the free particle. From (15.5) we see that the Lagrangian 
function is given by the expression 


L = 


-mcjg kn u k u n . 


(15.6) 
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, 1 / 2 ' 


Using the Lagrangian (15.6) we may write 

3 L 3 f Ik A 1 

= -me [gnu u) 

du n 3 u n L V 6 7 / 

= ~ 2 mC ( 8k l uk “0 7 (&» ^ + Snj M 7 ') ■ 

Using here gkjU k id — 1 and the symmetry of the metric tensor g, y - = g jn we obtain 


Thus, we may write 


dL l b. b 

— = — -me 2 u = —mcgknU = -mcu n . 

OU n 2 


d 3L d(«„ 

- = me — — . 

ds 3 w" ds 


Using the Lagrangian (15.6) we may also write 

dL 


dx n 


= 0. 


Substituting (15.9) and (15.10) into the Lagrange equations (15.2), we obtain 


d u n d 

m c — — = — (me u n ) = 0. 
d.s' d5 


(15.7) 


(15.8) 


(15.9) 


(15.10) 


(15.11) 


(15.12) 


The equations of motion of a free particle are then given in the following form: 

n da" 

u> n = = 0 (a = 0, 1,2,3). 
ds 

From (15.12) we see that a free particle moves along a straight line with constant velocity. 

Let us now investigate the non-relativistic limit of the action integral (15.5) when the 
particles are moving with low velocities (a<$/c). If we substitute (13.37) into (15.5) we 
obtain 


I = — me ‘ 


r { B / 7,2 rt B 

) tA ]/ 1 -f2 dt = J rA L T {x?,if)dt, 


(15.13) 


where Lj is a Lagrangian defined with respect to the non-scalar time variable t, given by 

Lj (jc“, if 1 ) = —me 2 Jl — ^ = —me 2 J 1 + (15.14) 


In the non-relativistic limit of particles moving with low velocities [v <3C c ) , we may use the 
approximation 


1 - « 1 


v 1 

TaP' 


(15.15) 


Substituting (15.15) into (15.13) we obtain 
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Thus, the non-relativistic approximation of the Lagrangian Lj can be written as follows: 

Lj {x 01 ,v a ) ^mv 2 — me 2 . (15.17) 

As the non-relativistic Lagrangian is defined up to an arbitrary additive constant, the 
second term in (15.17) does not contribute to the non-relativistic equations of motion and 
can be dropped in the non-relativistic applications. Its physical significance in the special 
theory of relativity will be discussed later in this chapter. Thus, the action integral (15.5) 
has the proper non-relativistic limit. 


15.2 Energy-Momentum Vector 

15.2.1 Introduction and Definitions 


In the non-relativistic mechanics there is a number of constants of motion. These include 
the energy and the momentum of the particle. The objective of this section is to define the 
energy and the momentum of a particle in the framework of the special theory of relativity. 
Let us start with the action integral (15.13), in the form 

I = f B L t (x“, if) df. (15.18) 

Jt A 

The Lagrange equations of motion of a particle , analogous to (11.12), defined in terms of 
the Lagrangian Lt with the non-scalar time variable t as the parameter, are given by 


dLj 

~dxf 


d / dLj \ 
df \3f“ / 


= 0 (a = 1,2,3). 


(15.19) 


Let us now calculate the total time derivative of the Lagrangian Lt as follows: 

d L r dL T dx“ 3 L r dif 

—L = — — + — — . (15.20) 

df 3 x“ df dv° df 

Using here the equations of motion (15.19) and the definition of the three-dimensional 
velocity (13.33), we obtain 


d Lt _ d td Lt\„ dLj dif _ d / dLj 
df df\3f“/ dv 01 df df\3c 0 ' / 


(15.21) 


The result (15.21) can be rewritten in the form 


d 

df 


\ 3f“ 


= 0 . 


(15.22) 


Thus, the quantity 


£ = 


if — Lt = Constant 

dif 


(15.23) 


is a constant of motion. The quantity (15.23) can be recognized from the non-relativistic 
mechanics as the total energy of the particle. Let us also define the momentum p a , 
conjugate to the coordinate x", as follows: 
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3 Lt a a a dL T 3 L t 

p "Hf^ p --S 

where 

{P 01 } = {Px,Py,Pz}, {p a \ = {-Px,-Py,-Pz} 
Substituting (15.24) into (15.23) we obtain 

£ = -p a if -L t = -gap jfyP -L t = pv-L T . 
Using the debnition (15.14) of the Lagrangian Lt, we can calculate 


3 Lj o 3 

p a = = mr 

dv“ 3 v 01 


1 + 


gvpi 


p vP\ 1/2 
c 2 ) ’ 


or 


, 1 / g v pv v vP\ l ^ 2 1 3 . ». 

= me 2 2 (l + — ^ 

Using the symmetry of the metric tensor g v p — gp v , we obtain 

g v pv v v fl \~ 1/2 


p a = m- 14- 


2gapV P . 


Using v a = g a p i/, we finally obtain 


Pa = 


m v a 


i 1 


=4 p = 


m if 


(15.24) 


(15.25) 


(15.26) 


(15.27) 


(15.28) 


(15.29) 


(15.30) 


Substituting (15.30) and (15.14) into (15.26), we obtain the total energy of the particle as 
follows: 


£=- 


mValf 


i 1 


+ me . 1 T = 


A 3 ? 


4 -m r, 1 — 7r 


ic 2 m (c 2 — v 2 ) 


AAs A-' 


+ m c z ,/ 1 T 


m c 


— m c 


Thus, the total energy of the particle is given by 


1 t + m c 2 , 1 77 = 


A 3 ? 


£ = 


mcr 


y 1 


(15.31) 


(15.32) 


The particle at rest with v = 0 has the so-called rest energy To given by 

To = wc 2 . 


(15.33) 
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The kinetic energy of the particle is obtained if the rest energy (15.33) is subtracted from 
the total energy (15.32). Thus, we obtain 


£k = £ — So = me 1 




- 1 


For the particles moving with low velocities (v <3C c) , we may write 




1 li 2 

^ ^ 1+ 2c2' 


Substituting (15.35) into (15.34) we obtain 


£k = -fray 2 ■ 


(15.34) 


(15.35) 


(15.36) 


In the same approximation (v <& c ), the components of the momentum of the particle are 
given by 

p a = mif. (15.37) 


The approximate results (15.36) and (15.37) are the well-known non-relativistic expres- 
sions for the kinetic energy and the momentum of a moving particle, respectively. 


15.2.2 Transformations of Energy-Momentum 

The results for the momentum of the free particle (15.30) and the energy of the free particle 
(15.32) were derived from the Lagrangian Lj, defined with respect to the non-scalar time 
variable t, in a non-covariant way. Now we want to define the energy and momentum as 
the constants of motion using the covariant Lagrangian L given by (15.6). In analogy with 
the definitions (15.24), we now define the energy-momentum four- vector p n with following 
covariant and contravariant components: 


p n = = mcu n =>■ p n = = meu" (15.38) 

r du n H du n 

From (15.11) we see that the four components of the energy- momentum of a free particle 
satisfy the equations 


d p n 
ds 


= 0 (ra = 0, 1,2,3). 


(15.39) 


Thus, for a free particle the four components of the energy-momentum four-vector 
are conserved. Using here (13.39), we obtain the temporal zeroth component of the 
energy-momentum vector, which is proportional to the energy £ of the particle, in 
the form 


me £ 



p° = m c u° 


c 


(15.40) 
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where the energy of the particle is given by (15.32). Furthermore, using (13.40), we obtain 
the three spatial components of the energy- momentum four-vector, which are equal to 
the components of the three-dimensional momentum of the particle p a given by (15.30), 
that is, 


p a = me u a 


m ii 01 



(15.41) 


From the results ( 15 . 40 ) and ( 15 . 41 ), we see that the energy and momentum of a particle 
are not two independent quantities as in the non-relativistic mechanics. In the relativistic 
mechanics they are components of the same four-vector. It should be noted that the 
constant of motion analogous to the non-relativistic result ( 15 . 23 ), is identically equal 
to zero 

f) T 

u n — L = — p n u n — L = 0. (15.42) 

du n p 

The energy-momentum vector as a four- vector, transforms with respect to the transforma- 
tions from one inertial system of reference to another, according to the transformation law 

p k = A k n p m , (15.43) 

where p k are the components of the energy-momentum tensor in the inertial system of 
reference K, while p rk are the components of the energy-momentum tensor in the inertial 
system of reference K' moving along the common x-axis with a velocity V with respect to 
K. Using here the explicit form of the tensor A k given by ( 13 . 29 ), we obtain 

£ = ; 1 (£' + Vpi) . 

<>=■«> 

V 1 IT 
Py = p'y 
Pz = Pa- 
using ( 15 . 38 ) with ( 13 . 41 ) we obtain 

p n p n = m 2 c 2 u n u n = m 2 c 2 . (15.45) 


From (15.45) we may write 


£ 2 

+ p a p a = m 2 c?. 


(15.46) 


From the results (15.25) for the components of the three-dimensional momentum vector, 
we may write 


p a p a = g a pp a p l) = -pp = -p 2 . 


(15.47) 
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Substituting (15.47) into (15.46), we obtain 


<-■ 2 2 2 
— — p = me. 


C‘ 


(15.48) 


The relation between the energy and the three-dimensional momentum of a particle then 
becomes 



(15.49) 


The function (15.49) is usually called the Hamiltonian of the particle and it is denoted by 
H. Thus, we may write 



(15.50) 


being the most usual definition of the Hamiltonian of a particle. 

15.2.3 Conservation of Energy-Momentum 

The conservation law of the momentum of a particle is a consequence of the homogeneity 
of space and the conservation law of the energy of a particle is a consequence of the 
homogeneity of time. Due to the homogeneity of space-time, the mechanical properties 
of a free particle remain unchanged after the translation of a particle from a point with 
coordinates x n to a point with coordinates x n + k n , where X n is an infinitesimally small 
constant four-vector. Thus, the Lagrange function must be invariant with respect to this 
translation and its variation must be zero. Therefore, we may write 


Using here the equations of motion (15.2) we obtain 



(15.51) 



(15.52) 


Since X n is a non-zero infinitesimal four-vector, the expression (15.52) gives 


d f dL 1 


(15.53) 


ds d u k 


Using now (15.38) we obtain 


d dL du n dp n 

= m c — — = — — 

ds 3 u n ds ds 


dpn 

ds 


(15.54) 


The expression (15.54) then gives 


dp n 1 dp' 1 


= 0 =>■ p n = Constant. 


ds 


v 2 d t 


(15.55) 
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Thus, we obtain the three-dimensional momentum conservation law 

d p 

— = 0 =>■ p = Constant, 
df K 

and the energy conservation law 


— = 0 =>■ £ = Constant, 
df 


(15.56) 


(15.57) 


1 5.3 Angular Momentum Tensor 

In the relativistic mechanics the angular momentum tensor is defined by the expression 

M „k = XnPk - x k p„ (k,n = 0, 1,2,3). (15.58) 


Only the spatial components of the angular momentum tensor with ( k , n— 1,2, 3) have a 
physical meaning and coincide with the usual definition of the angular momentum in the 
non-relativistic mechanics. In the non-relativistic mechanics it is customary to form an 
axial three-dimensional angular momentum vector 

M v = l -e mli M a p = e vafl X a pf, (a, £ = 1,2, 3) (15.59) 


or in the classical vector notation 


M = 


12 3 

X\ X2 X3 
Pl P2 P3 


= r x p. 


(15.60) 


The conservation law of the angular momentum tensor (15.58) is a consequence of 
the isotropic nature of the four-dimensional space-time. The three-dimensional angular 
momentum conservation law is a consequence of the isotropic nature of the three- 
dimensional space. Due to the isotropic nature of the four-dimensional space-time, 
the mechanical properties of a free particle remain unchanged after rotations in the 
four-dimensional space-time. Let us consider the special case of a rotation for some 
angle 9 about the 3-axis in the three-dimensional space. Then the relation between the 
coordinates z m in the rotated inertial system of reference K' and the coordinates x k in the 
original inertial system of reference K is given by 


z n = Qjx-i (j,n = 0, 1,2,3). 

Using (5.40) we may rewrite (15.61) in the matrix form 


*91 


' 1 0 00' 


fx°l 

z 1 


0 cos 9 sin0 0 


X 1 

z 2 


0 —sin6 cos 6 0 


X 2 

z 3 


_ 0 0 0 1 _ 


W 


(15.61) 


(15.62) 


If we now consider the rotation for some infinitesimal angle 89 ~ 0, then we have 


cos89 m 1, sintSO^Se. 


(15.63) 
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Substituting (15.63) into (15.62) we obtain 


~ Z° ~ 


' 1 0 00' 


r*°i 

z 1 


o 

O 


X 1 

z 2 


0-86 10 


X 2 

_z 3 _ 


0 0 0 1 


CO 

k 


( 15 . 64 ) 


or 


Z n = X n + 8njx ] ' = X n + 8Q nk g kJ xi = x n + 8Q. nk x k , 

where 8QJ is a mixed tensor defined by the following antisymmetric matrix: 



0 0 0 0 

0 0 89 0 

0 -89 0 0 

0 0 0 0 


( 15 . 65 ) 


( 15 . 66 ) 


Using here (13.9) and (13.12), we obtain the covariant coordinates x k of the four- 
dimensional space-time in matrix form 


[**] = [gjk] [ xJ ] 


o 

o 

o 


r*°l 


- x° - 

0-10 0 


X 1 


-X 1 

0 0-10 


X 2 


—X 2 

O 

O 

0 

1 

i—* 


_ X 3 _ 


1 

X 

CO 


( 15 . 67 ) 


We can also calculate the components of the antisymmetric contravariant tensor 8Q nk in 
matrix form as follows: 


= [y fc ] [sqj 1 ] , 


( 15 . 68 ) 


or 


" l 

0 

0 

0 


" 0 

0 

0 

0 ' 

0 

-l 

0 

0 


0 

0 

86 

0 

0 

0 

-1 

0 


0 

-86 

0 

0 

0 

0 

0 

-1 


0 

0 

0 

0 


( 15 . 69 ) 


Thus, we finally obtain the matrix form of the antisymmetric contravariant tensor 8£2 nk as 
follows: 



0 0 0 0 

0 0 -89 0 

0 89 0 0 

0 0 0 0 


( 15 . 70 ) 


For an arbitrary infinitesimal rotation, the contravariant tensor 8£l nk has a more complex 
form compared to the simple matrix (15.70), but it is always an antisymmetric tensor. 
Thus, we always have 


8Q. nk = -8Q kn (k,n = 0 , 1 , 2 , 3 ), ( 15 . 71 ) 

and the most general infinitesimal rotation of the coordinates is given by the following 
transformation relations: 
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z n =x n + 8x n , 8x n = SO nk x k . 


(15.72) 


The most general infinitesimal rotation of four-velocity vector is given by the following 
transformation relations: 


u' n = u n + 8u n , 8u n = SO nk u k . 


(15.73) 


The Lagrange function of a particle L(x'\ u n ) must be invariant with respect to this 
infinitesimal rotation, that is, we must have SL — 0. Let us now calculate the variation of 
the Lagrangian L with respect to the infinitesimal rotations (15.72) and (15.73), that is, 


or 


3 L „ 3 L „ 

SL = 8x n + 8u n = 0, 

dx n du' 1 


SL = —SO nk x k + —SO nk u k = 0. 
dx n k d u n k 


Substituting the equations of motion (15.2) into (15.75) we obtain 


8L = 80. 


nk 


' d 

< 3 L '' 

3 L 

ds 

\aic i 

) Xk+ ^ Uk _ 


= 0 . 


(15.74) 


(15.75) 


(15.76) 


Using the definition of the energy- momentum four- vector (15.38) and the definition of the 
four- velocity (13.34), we obtain 


SL = -SO 


nk 


d Pn , d x k 


= (80 nk p n x k ^ = 0 . 


(15.77) 


Using here the antisymmetry of the tensor <5£2 , (15.77) becomes 

SL = (PnX k - p k x n ) = ^ SO nk = 0. (15.78) 

From (15.78), as a direct consequence of the isotropic nature of the space-time, we obtain 
the conservation law of the angular momentum tensor, that is, 


dM„i- dM„i- „ 

= 0 => — = o => M nk = Constant 


(15.79) 


ds dr 

The spatial part of this conservation law for ( n , k= 1, 2, 3), that is, 

M" = e va ^x a pp = Constant (15.80) 

is the usual conservation law of the three-dimensional angular momentum vector. 


Electromagnetic Fields 



16.1 Electromagnetic Field Tensor 

The electromagnetic field in the four-dimensional space-time is described by a four- vector 

A n =A n (x k ), (16.1) 


which is called the potential of the electromagnetic field. The temporal component of this 
four- vector is defined by 


A o = Vd* fc ) 
c 


(16.2) 


where f (x k ) is the electric scalar potential. The three spatial components of this four- 
vector 


A a = A a (x k ), 


(16.3) 


constitute the so called magnetic vector potential. Thus, we may write 

A n = | “Vo J 

An = gnkA = | — V r > ~Ax, —Ay, — A z J . (16.4) 

The action for a particle with mass m and charge q moving in the electromagnetic field 
defined by A n (x k ) is then given by 

I = I s + Iq, (16.5) 


where I $ is the action for a free particle given by 

Sb 

: A 

and Iq is the action describing the interaction of the charged particle with the electro- 
magnetic field defined by the four- vector A n {x k ). The simplest invariant action that can 
be constructed using four- vectors A n (x k ) and u n , describing the electromagnetic field and 
the motion of the particle respectively, is given by 


g kn u k u n ds, 


(16.6) 



pSs f S B 

Iq = —q / gknA k u- n d s= -q ) A n 
J Sa J Sa 


u n ds. 


Substituting (16.6) and (16.7) into (16.5) we obtain 

cSb 


I=- 


g kn u k u n + qA n u n j ds. 


(16.7) 


(16.8) 
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The Lagrangian of the charged particle moving in the electromagnetic field defined by 
A n (x k ) is then given by 


L = —me 


\Jgk n u k u n - qA n u n . 


(16.9) 


Using (16.9) we obtain 


and 


d 

ds 


3L 

9x^ 



3A n 

= -<7 — r u ’ 
' dx k 


(mcu k + qA k ). 


(16.10) 


(16.11) 


Substituting (16.10) and (16.11) into the Lagrange equations (15.2) gives 

d , . 3 A n 

— (mcu k + qA k ) = 


or 


d u k ( 3 A n 3 A k \ 

me—— = q — ; ur. 

ds ' V dx k dx n ) 


(16.12) 


(16.13) 


In (16.13) we define the covariant electromagnetic field tensor F kn as follows: 

Fkn = ^ ^ = dkAn ~ dnAk ' (16 - 14) 
By definition (16.14), the covariant electromagnetic field tensor F kn is an antisymmetric 
tensor and it satisfies 


Fkn = -Fnk {k,n = 0, 1,2,3). (16.15) 

Substituting (16.14) into (16.13) we obtain the equations of motion of a charged particle in 
the electromagnetic field 

me = qF kn u n - (16.16) 

ds 

Using the definition of four-momentum p k = me u k in (16.16) we may write 

^ = qF kn u n . (16.17) 

Let us now demonstrate that the three spatial equations (16.17) in the three- 
dimensional vector notation are equal to the well-known expression for the Lorentz force. 
Equation (16.17) for the three spatial components (a = 1, 2, 3) has the form 

= qF an u n = qF a0 u° + qF^u?. (16.18) 

Using ( 13 . 37 ), ( 13 . 39 ), and ( 13 . 40 ) here, we obtain 

= q c F a0 + q FapvP . 


(16.19) 
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We then define the three-dimensional electric field vector denoted by E a and the three- 
dimensional magnetic induction vector denoted by B v , as follows: 


FaO = — , F a0 = — 
c c 

F a p = -e a p v B v , F «P = 


(16.20) 


where 


{ E a } = {E x ,E y ,E z } , {E a } = { -E x ,-E y ,-E z ) 

[B ? } = {B x , B y , B z } , {B a ) = { -B x , -By, -B z } 

Substituting (16.20) into (16.19) we obtain 

^ = qE a - qe a p v v ft B v . 

Using here p a = {-p x ,-Py,-Pz)> d* = {v x , v y , v z } and the expressions (16.21) for the com- 
ponents of the vectors E and B, (16.22) gives 


(16.21) 


(16.22) 


-j^- — ~qE x — q ( VyB z — v z By ) 

d 

= ~rjFy ~ q ( v z B x — v x B z ) 
^ I ^ = —qE z — q (v x By — VyBfi 


(16.23) 


Thus, we obtain the familiar expression for the Lorentz force in the three-dimensional 
vector notation, as follows: 


dw~_~ 
-f- = qE + q v x B. 
at 


(16.24) 


[Fkn] = 


Using the definitions (16.20), the covariant electromagnetic held tensor F^ n can be 
written in the following matrix form: 

0 E x /c Ey/c E z /c " 

~F X / c 0 —B z By 
By/ c B z 0 —B x 
—E z /c -By B x 0 

The mixed electromagnetic held tensor is given by 

Ffi = g k] F jn (j, k, n = 0, 1, 2, 3). 
and it can be written in the following matrix form: 


[Fn\ = [g fcj ] [Fjn] = 


0 E x /c E y /c E z /c 
E x /c 0 B z -By 
Ey/ c —B z 0 B x 
_ E z / c By —B x 0 


(16.25) 


(16.26) 


(16.27) 
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The contravariant electormagnetic field tensor F kn is given by 


nkn 


= g kJ Fjig ln = F k g ln (j,k,l,n = 0, 1, 2, 3) 


and it can be written in the following matrix : 


[F kn \ = [Ff\ [g" ! ] = 


0 -E x /c -Ey/c - E z /c 
E x / c 0 —B z By 

Ey/ c B z 0 —B x 

E z / c —By B x 0 


(16.28) 


(16.29) 


As the next step, let us now calculate the electric field vector E in terms of the potentials of 
the electromagnetic field (16.1). By definition (16.14) of the tensor F \ n , we have 


. 3 A 0 3 A a 

-C(y C Jr /y 0 C I r\ 

' dx* dx° 


Using now i jr = cAq and x° — ct we obtain 


E a = 


3 x/r 3 Act 


dx a 


dt ’ 


or 


™ 3 A a . . a 3 A a 

E S?‘lF“ (grad * r) “IT- 


(16.30) 


(16.31) 


(16.32) 


The contravariant components of the four-dimensional gradient of a scalar function i/r are 
given by 


g kn 7rx; = I -^.-grad^^.-grad f,-grad.^ 


df 


dx 11 


1 df 


C dt 


(16.33) 


Using (16.33) for (a = 1,2,3), we obtain the relation between the electric field vector 
E and the potentials of the electromagnetic field (16.1), in the three-dimensional vector 
notation 


dA 

E= -gradiA. 

dt 


(16.34) 


Let us now calculate the magnetic induction vector B in terms of the potentials of the 
electromagnetic field (16.1). By definition (16.14) of the tensor F^, we have 


F a p = d a Ap — dpA a = — 8p8^ d a A z = 


£vap@ XOT d a A r — e a p v e vaT d a A T — e a p v B x 

where we have used the identity (10.45) in the form 

„ ^vcrr o<7 or c-cr or 
evape —°a°p—°p°ct’ 


(16.35) 


(16.36) 


From (16.35) we obtain 


B v = -e var d a A z . 


(16.37) 
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Using (16.37) and the definition of the curl of the vector A, we may write 


r d(-A z ) 

3(— Ay)l 

1 

1 ^ 

3 Ay 

= curl x A 

l 3 y 

dz 

3y 

dz 

r 3 (~ a x ) 

3(-A z )l 

dA x 

1 05 

> 

N 

= curl,,A 

dz 

dx 

dz 

dx 

[3 (“Ay) 

3(-Ac)1 

1 

!■> 

3 A x 

= cuii z 4 

dx 

3 y \ 

dx 

9y 


Equations (16.38) written in the three-dimensional vector notation give 

B = curlA. (16.39) 


The result (16.39) is the familiar relation between the magnetic induction vector B and the 
magnetic vector potential of the electromagnetic field (16.3). 

The temporal zeroth component of (16.16) gives 

me 2 = qcFo a u a , (16.40) 

ds 

or 


d 

df 




qcFoaif. 


From (16.20) we see that cFocif 1 = E ■ v. Thus, we finally obtain 


(16.41) 


<1£k 

dt 


= qE -v. 


(16.42) 


Equation (16.42) is the statement that the change of the kinetic energy £k of a charged 
particle in the electromagnetic field is equal to the work done by the electric field E. The 
work done by the magnetic field is identically equal to zero, since the force of the magnetic 
field qv x Bis always perpendicular to the direction of the velocity v. 


1 6.2 Gauge Invariance 

From the result (16.24) we see that, by measurements of the forces acting on a charged 
particle in the electromagnetic field, we can measure the components of the three- 
dimensional electric field vector E and the three-dimensional magnetic induction vec- 
tor B. Thus, the components of the electromagnetic field tensor are observable physical 
quantities which are uniquely defined. On the other hand, the components of the poten- 
tial of the electromagnetic field (16.1) are not uniquely defined. From the definition of the 
electromagnetic field tensor (16.14) we see that its components are invariant with respect 
to the gauge transformations of the potential of the electromagnetic field 

d<j> 
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where <j> is an arbitrary scalar function. Substituting (16.43) into (16.14), we obtain 


- _ dA , i 3 Ak _ 3 A n _ 3 Ak _ 

“ TT ” 3^ “ "TT 3x" ~~ kn ~ 


( 16 . 44 ) 


Thus, we need to impose an additional condition on the components of the potential of 
the electromagnetic field in order to make it more precisely defined. Such condition is 
usually called the gauge condition or simply the gauge of the theory. In the relativistic 
electrodynamics the most commonly used gauge is the Lorentz gauge which requires that 
the components of the potential of the electromagnetic field satisfy the equation 


dA n 
3 x n 


3 A n 
3x" 


3 A 0 
3x0 


3 A a 
3x“ 


T~n + 7 = divA + — — = 0. 

c 2 dt 


dijr 


( 16 . 45 ) 


When the Lorentz gauge (16.45) is adopted, then using (16.43) we obtain 

3 An dAn ^ 3 2 (f) 

dXfi 9x^2 dXndX n 


( 16 . 46 ) 


As both potentials A(x k ) and A(x k ) satisfy the Lorentz condition (16.45), the arbitrary 
function <p(x k ) must satisfy the wave equation of the form 


3 2 <t> 

dx n dx n 


y 1 3 2 <t> 

= V 2 (j)- ——A = 0 . 
C 2 dt 2 


( 16 . 47 ) 


Thus, <p(x k ) is no longer an arbitrary scalar function but a solution to the wave equation 
(16.47). It should be noted here that, even when we impose the Lorentz condition (16.45), 
the components of the potential of the electromagnetic held are still not uniquely defined 
and only the class of the allowed gauge transformations (16.43) is significantly reduced. 


16.3 Lorentz Transformations and Invariants 


The potential of the electromagnetic held A k , as a four-vector, transforms with respect 
to the transformations from one inertial system of reference to another according to the 
transformation law 


A k = A k A m (k,n = 0 , 1 , 2 , 3 ), 


( 16 . 48 ) 


where A k are the components of the energy- momentum tensor in the inertial system of 
reference K, while A'" are the components of the energy-momentum tensor in the inertial 
system of reference K' moving along the common x-axis with a velocity V with respect 
to K. Using here the explicit form of the tensor A k given by (13.29), we obtain 



( 16 . 49 ) 
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The transversal components of the vector A remain unchanged and only the longitudinal 
component is impacted by the transformation from the inertial system of reference K to 
the inertial system of reference K' , moving along the common x-axis with a velocity V with 
respect to K. 

The electromagnetic held tensor F^ transforms from one inertial system of reference 
to another, according to the law 


Fjl = a|a n iF’ kn (j, 7c, 7, n = 0, 1, 2, 3), 


( 16 . 50 ) 


where Fji are the components of the energy-momentum tensor in the inertial system 
of reference K while F', are the components of the energy- momentum tensor in the 
inertial system of reference K' moving along the common x-axis with a velocity V 
with respect to K. Using here the explicit form of the tensor given by (13.29), we 
obtain 



( 16 . 51 ) 


Introducing here the notation 



( 16 . 52 ) 


we may write 



( 16 . 53 ) 


Substituting (16.53) into (16.51), we obtain 



F02 = yFq 2 + ^yF[ 2 
F03 = yFq 3 + j yF [ 3 
F23 = F' 2 3 , 


f 3 1 = yFsi + ty f 30’ 
Fl2 = yF[ 2 + tYK 2 - 


( 16 . 54 ) 
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Using now the explicit form of the tensor Fj cn given by (16.25), we have 


E X = E' X 

E y = y{e' v - VB' z ) , 
E z = y (e' z + VB'y) , 
B x = B' x 

By = Y ( B'y + %E' Z ) , 
B z = y (B> z -%E'). 


(16.55) 


Thus, we finally obtain the transformation laws for the three-dimensional electric field 
vector E and the three-dimensional magnetic induction vector B in the form 


— E' 

1 

1 

E z + V B' 
F.„- y 




By — B' 

o B ’y + 

R, — 

a 

1 

ta 

1 

ToN 

u x u x , 




(16.56) 


(16.57) 


The longitudinal components of the vectors E and B remain unchanged and only the 
transversal components are impacted by the transformation from the inertial system of 
reference K to the inertial system of reference K', moving along the common v-axis with a 
velocity V with respect to K. 

In order to find the invariants of the electromagnetic field, following the discussion 
leading to (7.5), we form the secular equation for the electromagnetic field tensor in the 
form 


det(T^ - X8*) = 0, 


(16.58) 


where X is by definition a scalar invariant. Using here (16.27) and introducing for simplicity 
a vector e = E/c, we obtain 


e x 


-X Bz — By 


Gy B z X 


Br 


By —B x —X 


= 0. 


(16.59) 


Expanding the expression (16.59) and adding together the terms of the same order in the 
scalar parameter X, we obtain 

— X |^— X ^7. 2 + B 2 ^ — B z ( XB Z — B x By ) — By (B z B x + X By ) J 

- e x [e x (x 2 + £ 2 ) - B z (- Xe y - B x e z ) - B y (~e y B x + Xe z ) j 

+ Gy [fix (XB Z — B X By ) + X ( — XGy ~ B X G Z ) — By ( Gy By + C Z B Z )] 

- Cz \g x (B z B x + XBy ) + X (— G y B x + Xe z ) + B z ( G y B y + e z B z )] = 0 


(16.60) 
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or 


X 4 + X 2 B 2 + X 2 B 2 — XB x B v B z + l 2 B 2 + XB x B y B ; 


A 2 e 2 ■ 




Xe x eyB z ■ 


y | ,^ x ±JyU Z 

&xB x e z B z ~ exBxCyBy + ke x Bye z 

4- ke x e y B z — e x B x e y By — k 2 e^ — kB x e y e z — &yB 2 — eyBye z B z 
- e x B x e z B z - Xe x B y e z + XB x e y e z - X 2 e 2 z - e y B y e z B z - e 2 B 2 = 0. 


(16.61) 


After regrouping and cancellation of all terms equal in magnitude but with opposite signs, 
we obtain 


X 4 + X 2 (b 2 + B 2 + B 2 — — e 2 — e 2 ) - (4 B l + 4 B y 

i t' 2 B 2 + 2 e x B x e y B y + 2 e x B x e z B z + 2e y B y e z B z ^ 


y 

= 0. 


In the three-dimensional vector notation (16.62) becomes 

. 2 


X 4 + X 2 (b 2 - e 2 ) - (e • =0. 


Using here e = E/c, we obtain 


X 4 + X 2 l B 2 r — 


EB 


= 0 . 


(16.62) 


(16.63) 


(16.64) 


Equation (16.64) can also be written in terms of the electromagnetic field tensor F kn as 
follows: 


{l/knF kn ) ~ Q ^ kln F jk F ln j = 0. (16.65) 

Since X is an invariant absolute scalar, the quantity 

1 E 2 

-FknF kn = B 2 = Invariant (16.66) 

2 c 2 

is an absolute invariant of the electromagnetic field. The quantity 

-e* u FjkFin = = relative invariant (16.67) 

is a relative invariant, since B = curlA is a relative axial vector. However, from ( 16 . 65 ) we see 
that the square of the quantity ( 16 . 67 ) is also an absolute invariant of the electromagnetic 
field. These are the only two scalar invariants that can be constructed using the electro- 
magnetic field tensor F kn . 



17 

Electromagnetic Field Equations 


17.1 Electromagnetic Current Vector 

Let us consider a system consisting of a number of charged particles moving in an 
electromagnetic field specified by the electromagnetic potential four- vector 

A n =A n (x k ) (lc,n = 0, 1,2,3). (17.1) 


the electromagnetic field is a sum of terms (16.7), given by 



(17.2) 


where M is just a label for the Mth particle and not a tensor index. An explicit summation 
sign is therefore required in (17.2) . In the electromagnetic held theory it is usually assumed 
that there is a continuous distribution of charges in the three-dimensional space, with the 
charge density defined by 


d q 
dV’ 


(17.3) 


where the differential dV is an infinitesimal volume element of the three-dimensional 
space. In such a case (17.2) can be written as follows: 


= - f adV [ 

Jv JSa 


Sb dx" dx° 
An ' 

df c 


(17.4) 


Let us now introduce the four-dimensional volume element, defined by the expression 

dfl = dx° dV = dx° dx 1 dx 2 dx 3 . (17.5) 


Since the four- dimensional space in the special theory of relativity is a pseudo-Euclidean 
space, the four-dimensional volume element (17.5) is a scalar invariant. Substituting (17.5) 
into (17.4), we obtain 

1 7 dx" 

Iq = a— A n dfl. (17.6) 

c dt 

Since the action integral (17.6) and the four- dimensional volume element (17.5) are both 
scalar invariants and A n is a covariant four- vector, the system defined by 

dx n 

J n = o- T-> (17.7) 

dt 

is a contravariant four-vector called the electromagnetic current vector. The temporal 
component of the four-vector (17.7) is proportional to the charge density a and is given 
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by 


/° = ca. 


(17.8) 


The spatial components of the four-vector (17.7) constitute a three-dimensional current 
density vector. It is the flux of the charge q through the element of the surface Id that 
surrounds the volume domain V of the three-dimensional space in which the charges are 
distributed. 


/“ = a if = 


d 3 q 

dndr' 


(17.9) 


The components of the electromagnetic current vector can therefore be structured in the 
form 


7 " = {CO,J X JyJ Z } , 


Jn — EnkJk — { Ca , — Jx, ~Jyi ~Jz\ ■ (17.10) 

Substituting (17.7) into (17.6) we obtain 

Iq = —— [ J' 1 A n dQ. = - [ C Q dQ, C Q = -7" A«, (17.11) 

c Jn c Jn 

where Cq is the interaction Lagrangian density. Due to the electric-charge conservation 
law, the charge density o and the current density J satisfy the continuity equation. 
According to the electric- charge conservation law, the negative increment of the electric 
charge q within a three-dimensional volume V is equal to the total flux of electric charges 
through the boundary surface n of the volume V in the unit of time. Thus, in the three- 
dimensional vector notation, we may write 

ih - < i7i2 > 

Using here the three-dimensional Gauss theorem and regrouping, we obtain 




(17.13) 


From (17.14), we obtain the differential continuity equation in the form 

da 

— + div/ = 0. (17.14) 

at 

In the tensor notation, the result (17.14) becomes 


97 “ 

— = 0 (72 = 0,1,2,3). (17.15) 

dx n 

The continuity equation (17.15) is therefore related to the conservation law of the total 
charge in the entire system, defined by 


q = f adV=-f 7 ° dV, 
Jv c Jv 


(17.16) 
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where V is the entire available volume of the system. Since there is no flux of electric 
charges through the boundary surface n of the volume V of the entire system, the surface 
integral in (17.12) vanishes and we see that the time derivative of the quantity (17.16) is 
equal to zero. 


17.2 Maxwell Equations 

The objective of the present section is to derive the differential equations satisfied by the 
electromagnetic field tensor and its components. From the definition of the electromag- 
netic field tensor (16.14), that is, 


_ dA n 
- dx k 


dA k 

dx' 1 


— d k A n — d n A k , 


we see that it satisfies the cyclic equation 


( 17 . 17 ) 


djF kn + dnFjk + dkF n j = djd k A n — djd n A k 
+d n djA k - d„d k Aj + d k d n Aj - d k djA n = 0 


( 17 . 18 ) 


Thus, the first differential equation satisfied by the electromagnetic field tensor is 

dF kn djjk d_Fnl = Q 

dxi dx' 1 dx k 


( 17 . 19 ) 


When all three indices j, k and n are equal to each other ( j = k = n ), (17.19) is a trivial 
identity since F kn = 0 for k — n. When two of the indices j, k and n are equal to each 
other (/ = k or j — n or k—n), (17.19) is also a trivial identity due to the antisymmetry of 
the electromagnetic field tensor F kn = —F nk . The only equations of interest are the four 
equations obtained for j / k / n. Thus, we may write 


dFoi dFzo 9Fi2 
dx 2 dx 1 dx° 


dFoi 9F30 dF 13 _ 

dX 3 dx 1 dx° 


( 17 . 20 ) 


dFo2 dF30 dFz3 _ 
dx 3 dx 2 dx° 

dFu dF3i dF23 _ _ 
dX 3 dx 2 dx 1 

Using here the components of the covariant electromagnetic field tensor (16.25), we 
obtain 

dE x dEy 9 B z ^ 

3 y dx 3 1 


dE x dE z d By q 

3 z dx 3 1 


( 17 . 21 ) 
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dEy 3 E z 3 B x p 

3 z 3 y 3 1 

3 B z 3 By 3R X 

— + — + — = 0 . 

3z 3y 3x 

In the three-dimensional vector notation the result (17.21) gives the first pair of Maxwell 
equations, that is, 

-► dB -* 

curlE -1 =0, divf? = 0. (17.22) 

3 1 

Thus, (17.19) is the four-dimensional form of the first pair of Maxwell equations. 

In order to derive the second pair of Maxwell equations we need to define the action 
of the electromagnetic field. The total action for a system consisting of a continuous 
distribution of charged particles in the electromagnetic field is given by 

I = I s + Iq + I P , (17.23) 


where I s is the action for the free particle distribution that does not include the elec- 
tromagnetic fields or potentials. Its explicit form is therefore not needed in the present 
section. The term Iq is the action describing the interaction of the charge distribution 
with the electromagnetic field and it is given by (17.11). The term Ip is the action of the 
free electromagnetic field which is an integral of an invariant scalar function called the 
Lagrangian density Cf over the entire three-dimensional volume V in the time interval 
[tA, f/i]. Thus, the action of the electromagnetic field is of the form 

If = f [ B Cf (A n , 3 k A n ) dV d t=- f C F (A n , 3 k A n ) da. (17.24) 

Jv Jt A c Jo. 

The invariant Lagrangian density function Cf cannot depend on the potentials A n as 
they are not uniquely defined. It may only depend on the space-time derivatives of the 
potentials BtA n , or in other words on the electromagnetic field tensor F^ n . Thus, we may 
write 

If = - f C F (3 k A n ) da = - f C F (F kn ) da. (17.25) 

c Jn c Ja 


Furthermore, due to the linearity of the electromagnetic field equations, the invariant 
Lagrangian density function Cf must be at most a quadratic function of the electromag- 
netic field tensor. The only field invariant that satisfies this condition is given by (16.66). 
The invariant Lagrangian density Cf is therefore given by 


Cf (Fkn) 


-JLpknp. 

4/xo 2 2 , 10 ' 


(17.26) 


The invariant (16.66) can be multiplied by an arbitrary constant and we have chosen 
this constant to be equal to (-2/ro) -1 in order to secure the correct physical dimensions. 
In (17.26) the quantities /to and <?o= (/roc 2 ) -1 are the vacuum magnetic permeabil- 
ity and the vacuum electric permittivity, respectively. Substituting (17.26) into (17.25) 
we obtain 
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dl^ df. 


(17.27) 


Substituting (17.11) and (17.27) into (17.23) we obtain the total action of the system in the 
form 



(17.28) 


During the derivation of the equations of motion of a charged particle in the electro- 
magnetic field (16.16), we have assumed that the electromagnetic field is defined by four 
given functions forming the four- vector A n = A n (x k ). We have therefore only varied the 
Lagrangian (16.9) with respect to the quantities describing the motion of the particle, that 
is, the space-time coordinates x k and coordinates of the four-velocity u k . On the other 
hand, in the present derivation of the electromagnetic field equations, we assume that 
the motion of the charged particles or a continuous distribution of charges in space is 
defined by given space-time coordinates x k and coordinates of the four-velocity u k . We 
will therefore calculate the variations of the action (17.28) with respect to the quantities 
describing the electromagnetic field, that is, the electromagnetic potential four- vector A n 
and the electromagnetic field tensor F kn . Thus, the variation of the first term in (17.28) 
is zero and this term does not contribute to the derivation of the electromagnetic field 
equations. It can therefore be omitted from the action of the system. Furthermore, the 
electromagnetic current four- vector J n , as a function of a given charge distribution and its 
motion in space, is also not varied. The variation of the action (17.28) is then given by 



(17.29) 


with 


£ (An, d k A n ) = -J n A„ - -L F kn F kn . 

4 Mo 


(17.30) 


From (17.29) we may write 



(17.31) 


Since we have S (d k A n ) = <) k (M„), we may write 




(17.32) 


Applying the Gauss theorem to the second integral on the right-hand side of (17.32), it 
is reduced to the integral over the hypersurface which encloses the given domain £2 of 
the four-dimensional space-time. On the other hand, the variation of the electromagnetic 


136 TENSORS, RELATIVITY, AND COSMOLOGY 


field variables is assumed to be zero on the boundary of the domain O, and this integral 
vanishes. Equation (17.32) then becomes 


,1=1 [ 1-^-3* 
c Ja 1 3 d„ 


3£ 


SA n dtl = 0. 


(17.33) 


L3 (d k A n )_ 

Since the variation SA n is arbitrary, from (17.33) we obtain the electromagnetic field 
equations in the following form: 


k 


d£ 


3 ( 3 k A n) 


= 0. 


Using now the expression (17.30) we obtain 


_ _jn 

3 A n 


and 


d£ 


d£ dFji dC 


3 (3 k A n ) 3 Fji 3 (3 k A„) dF fl 3 (3 k A„) 


(' dj'Ai - 3 iAj) 


d£ 

dF;/ 


(w - s f‘") 


d£ d£ d£ 
= 2 - 


9 Fkn dFnk 3 F kn 


= 2 - 




dF/cn \ 4 fi 0 


2 no 3 F]a 


(- F Jl Fjl ) 


(17.34) 


(17.35) 


—FJ‘ ( - Sh") = — (F kn - F nk ) 

2no V ] 1 1 1 > 2 MO V ) 


pkn pnk\ 1 pkn 

MO 


Substituting (17.35) and (17.36) into the field equations (17.34), we obtain 


-r - 3 1 ( ~—F kn ) = 0 , 

MO 


or after regrouping 


3 F kn 
dx k 


= HoJ n (k,n = 0, 1,2,3). 


The temporal component of (17.38) gives 


or 


3 F 0a 1 dE a 1 , - n 

= = -divf? = no J = no ca , 

dx° c dxA c * 


divT = no c 2 a = — . 


60 


The three spatial equations (17.38) give 

3 F 0a 3 pP 01 
dx° dxP 


= mo r, 


(17.36) 


(17.37) 


(17.38) 


(17.39) 


(17.40) 


(17.41) 
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or 

C 2 dt dxP C 2 dt dxP M oJ ■ 

In the three-dimensional vector notation (17.42) is given by 

.5 1 

curlB - — — = //o/. 

C z dt 

Equations (17.40) and (17.43) are the second pair of Maxwell equations. Thus, the com- 
plete system of electromagnetic field equations in the four-dimensional notation is 
given by 

3 F kn 


( 17 . 42 ) 


( 17 . 43 ) 


dF kn dFj k 3 F nj = p 

3 xi dx n dx k 


dx k 


= Mo /", 


with the continuity equation 


dj n 

~dxP 


= 0. 


( 17 . 44 ) 


( 17 . 45 ) 


The four-dimensional formulation of Maxwell equations (17.44) with (17.45) is quite 
compact and it nicely emphasizes the relativistic nature of these equations. 


17.3 Electromagnetic Potentials 


The objective of this section is to formulate the differential equations for the electro- 
magnetic potentials A n =A n (x k ) and to outline their solutions. In order to derive the 
differential equations for the electromagnetic potentials, we substitute the definition of 
the electromagnetic field tensor ( 16 . 14 ) into (17.38). Thus, we obtain 


3 fc (d k A n - 3 n A k \ = 3*3*71" - d n d k A k = no J n . 


( 17 . 46 ) 


Using here the Lorentz gauge (16.45), the second term on the left-hand side vanishes, and 
we obtain the differential equation for the potentials of the electromagnetic field in the 
form 

3 2 A n 


dx k dxi 


= MO/"- 


By expanding (17.47) we obtain 


1 3 2 A n 
C 2 dt 2 


V 2 A" = mo/"- 


or 


V 2 A n - 


1 3 2 A' 1 


C 2 dt 2 

The solution of this equation has the form 


= -mo/". 


A n (x k ) = ^ 

4 n Jv 


i 


J n (x° - R.x 01 ) 
R 


dV, 


( 17 . 47 ) 


( 17 . 48 ) 


( 17 . 49 ) 


( 17 . 50 ) 
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where the integral is over the domain V where the sources of the electromagnetic field 
are distributed. The quantity!? in (17.50) is the three-dimensional distance between the 
position of the sources and the position where the potential is calculated, that is, 

R = J\x? -x a l 2 . ( 17 . 51 ) 

Thus, for a given electromagnetic current four-vector J n (x k ), we can calculate the electro- 
magnetic potential four- vector A” (x fc ) using the result (17.50). The electromagnetic field 
tensor is then obtain using the definition (16.14). 


17.4 Energy-Momentum Tensor 

We have shown earlier that the momentum conservation law is a consequence of the 
homogeneity of space and the energy conservation law is a consequence of the ho- 
mogeneity of time. In the four-vector language, due to the homogeneity of space-time, 
the physical properties of a free field remain unchanged with respect to the space-time 
translations. Thus, the energy-momentum tensor of the electromagnetic field is a field 
invariant defined with respect to the space-time translations. The Lagrangian of the free 
electromagnetic field is given by (17.26), that is, 

£ (Fkn) = £f ( Fkn ) = — F kn F kn . ( 17 . 52 ) 

4/ro 

Furthermore, it is assumed that there are no sources of the electromagnetic field, that is, 
that the electromagnetic current four- vector is equal to zero, J n = 0. Thus, the interaction 
Lagrangian Cq, defined by (17.1 1), vanishes. The second pair of Maxwell equations (17.38) 
then gives 

d k F kn = 0 . ( 17 . 53 ) 


In order to define the energy-momentum tensor of the electromagnetic field, let us first 
calculate the space-time derivatives of the Lagrangian density C as follows: 

d n C = ^d n Aj + ( 3 kAj) . ( 17 . 54 ) 

Using here the electromagnetic field equations (17.34), we obtain 


or 


d n C = d k 


dc 

3 ( 9 kAj) 


d n Aj + 


dc 

3 ( 3 kAj) 


dk (dnAj) , 


( 17 . 55 ) 


d n F = d k 


dC 

3 ( 3 k A j) 


dnAj 


On the other hand, by definition we may write 


( 17 . 56 ) 
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From (17.56) and (17.57) we obtain 

3 k 


, 8£ , d n Aj - 8 k C 
3 ( 3 kAj) 1 n 


= 3 k T k = 0, 


( 17 . 58 ) 


where T k is the mixed energy-momentum tensor of the Electromagnetic field, defined by 


rpk 

' n — 


dC 


dnAj — 8 n C. 


3 (3 k Aj) 

Substituting (17.36) and (17.52) into (17.59) we obtain 


T k = _J_ F kj dnA . + -LsfrfflFn. 

Mo 


"" nj ' T 4 M n Jl ' 


The result (17.60) can be put into a more convenient form by writing 

F k id n Aj = F k > ( dnAj - djAn) + F k j dj A n , 


( 17 . 59 ) 


( 17 . 60 ) 


( 17 . 61 ) 


or 

F k Jd n Aj = F k >F n j + dj - A n djF k h ( 17 . 62 ) 

The third term on the right-hand side of (17.62) vanishes due to the result (17.53). The 
second term on the right hand of (17.61) side makes no contribution to the conservation 
law (17.58) since we have 


3 * 3 / ( F kJ A n ) = d k djV kj = 0 . 


( 17 . 63 ) 


The expression (17.63) vanishes as a product of the symmetric tensor d k d y and the tensor 
V„ that is antisymmetric with respect to its two upper indices. 

Thus, the second term does not contribute to the field invariants obtained from the 
energy-momentum tensor (17.60). Therefore, we may replace F k id n Aj by F lc i F n j in (17.60) 
to obtain the final result for the energy- momentum tensor of the electromagnetic field in 
the form 


l 

The contravariant energy-momentum 

j^nk 

j 

and the covariant energy- momentum 


-F KJ 1 


M0 


( 17 . 64 ) 


( 17 . 65 ) 


( 17 . 66 ) 


Using the differential form of the energy-momentum conservation law (17.58), we may 
write down the integral form as follows: 


f d k T nk dV=- f ^—dV+ f d a T f>a dV = 0. 
Jv c J v 3 1 Jv 


( 17 . 67 ) 
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Applying the three-dimensional Gauss theorem to the second integral in (17.67), 
we obtain 



r"° dv 


+ <j> T P a dn a = o 


(17.68) 


where n is the closed surface surrounding the volume V. If we let the field domain V grow 
to infinity ( V — >• oo), the surface integral in (17.67) vanishes and we have 


f d k T nk dV = 4- f- f T n0 
Jv dt \_c Jv 


dV 


dp n 

dr 


= 0. 


(17.69) 


Thus, we obtain the definition of the conserved energy-momentum vector of the electro- 
magnetic field 


p n = - [ T n0 dV = Constant. 
c Jv 


(17.70) 


The energy of the electromagnetic field is then 


£ = cp° = f T 00 dV, (17.71) 

Jv 

and the components of the three-dimensional momentum of the electromagnetic field are 
given by 

p a = - f T a0 dV — Constant. (17.72) 

c Jv 

In order to calculate the energy (17.71) we first use (17.65) with (17.27) to calculate 


r r 00 p® 01 Z 7 O 

1 ~ ,, r r a 

M0 


/ e 0 E 2 _ B 2 \ 

V 2 2po) 


(17.73) 


Using the definitions of the mixed and contravariant electromagnetic field tensors (16.27) 
and (16.29) respectively, we obtain 


1 ^ E 2 j (e 0 E 2 B 2 ^ 


no 


V 2 2 p 0 J 


= eo E‘ 


_ Uo&_ _ B 2 \ 


V 2 


or 


joo _ gQ E 2 + 


2mo7 ’ 

B 2 

2/ro 


(17.74) 


(17.75) 


Substituting (17.75) into (17.71) we obtain the expression for the total energy of the 
electromagnetic field 


£ = 


gp E 2 B 2 \ 

2 2pp ) 


X 


dV= / e dV, 


(17.76) 
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where e is the electromagnetic energy density given by 


e = r°° = + b2 


(17.77) 


2 2/x 0 

In order to calculate the momentum (17.72) we first use (17.65) with (17.27) to calculate 


jaO L pOjpa _ gOtO ( e °F _ 

MO 1 V 2 2 M0/ 


Using here g' y = 0, we obtain 


T’t'O _ ^_pOP g pva _ LfOp* 11 , 

Mo ^ Mo V Mo v 


Using here (16.20) in the form 


we obtain 


F° = ~~Ev, F va = -e va '“B ni , 


T m = 


cm o 


cm o 


Using here the notation D = sqE and (cmo) 1 = ceo, the result (17.81 becomes 


r u = c 


( D X . 


(17.78) 


(17.79) 


(17.80) 


(17.81) 


(17.82) 


Substituting (17.82) into (17.72) we obtain the expression for the total momentum of the 
electromagnetic field 




= VdV, 


(17.83) 


where V is the electromagnetic momentum density given by 

V = - T a0 = D x B. (17.84) 

c 

In order to calculate the spatial components of the electromagnetic energy- momentum 
tensor in terms of the three-dimensional electric field vector E and magnetic induction 
vector B, we use F k i = —pi k and (17.26) to rewrite the definition (17.64) as follows: 


'T'K 

1 n ~ ~~~ 

Mo 

1 

Mo 


FnjF ik - \h k n ( e 2 - B 2 ) 


(17.85) 


where we introduced the vector e = E/c and the tensor & k = F n jFi k to simplify the 
calculations. The contravariant electromagnetic energy-momentum tensor is then 
given by 


rfnk „nj 'T'k 1 

O J 1 

J MO 


@ nk - i g nk (e 2 - B 2 ) 


(17.86) 
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We can now use the matrix forms of the covariant and contravariant electromagnetic field 
tensors (16.25) and (16.29) respectively, to calculate the components of the tensor ©Jj 


[ef,] - 


' 0 

e x 

e y 

e z 


1 

0 

1 

£ 

- e y 

-e z 

-e x 

0 

-B z 

By 


e x 0 

-B z 

By 

-e y 

B z 

0 

~B X 


€y B Z 

0 

-B x 


-By 

B x 

0 


C Z —By 

B x 

0 

- 

e 2 


@yBz 

^zBy € Z B X 

- e x B , 

e x l 


e yB x 

-e y B z + e z B y e x — By — B 2 e x e v + B x B y e x e z + B X B Z 


- e z B , + e x B z e y e x + B y B x e 2 — B 2 — B 2 e y e z + B y B, 
—e x By + e y B x e z e x + B Z B X e z e y + B z B y e 2 — B 2 — 

e 2 (e x B) x (e x B) y (e x B) z 

—(ex B) x e 2 + B 2 — B 2 e x e y + B x B y e x e z + B x B z 
—(exB)y e y e x + B y B x e^, + B 2 — B 2 e y e z +B y B z 
-—(ex B) z e z e x + B z B x e z e y + B z B y + B 2 — B 2 

The spatial part of the tensor ©Jj is then given by the following matrix: 

r £ 2 4- B 2 — B 2 e x e y + B x B y e x e z + B x B, 

■ B 2 e y e z + B V B , 


B y 


(17.87) 




e v e x + ByB x e^ + B^ 


vyt'X T 

e z e x - I B Z B X 


uyi-z 


e z e y + B Z B V e? + B 2 — B 2 


(17.88) 


The contravariant components of the three-dimensional system (17.88) are obtained from 


or in the matrix form 

[©“^] = 




—e 2 — B 2 +B 2 —e x e y — B x B y —e x e z — B X B Z 


&y@x ^y 
@z@x B z ±j x 

From the matrix (17.90) we obtain 


ByB x 


b 2 + b 2 




B y B Z 


—e z e x — B Z B X —e z e y — B z B y —e 2 — B 2 + B 2 


QOifi = _ e a e fi _ gigf) _ g afi B 2 


E a EP 


- BBBP - g afi B 2 


(17.89) 


(17.90) 


(17.91) 


Substituting the result (17.91) into the spatial part of (17.86), we obtain 

1 


j-'Olf} 

MO 


1 

Mo 


ePeP _ - g afi B 2 - l -g ap (e 2 - B 2 ) 

dV + EBB? + ^ (« 


[e 2 +B 2 


(17.92) 


Thus, we finally obtain the spatial components of the electromagnetic energy- momentum 
tensor in terms of the three-dimensional electric field vector E and magnetic induction 
vector B, as follows: 
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T afl = -T af> , (17.93) 

where we define the Maxwell stress tensor T“^ by the equation 

T 0 ^ = eo E a E fi + — FBP + g af) (e 0 E 2 + — ) . (17.94) 

Mo V Mo ) 

Using the results (17.77) with (17.84) and (17.93) we may write the explicit matrix form of 
the electromagnetic energy- momentum tensor as follows: 


g 

V 1 

V 2 

v 3 


V 1 

-T n 

-T 12 

-r 13 

(17.95) 

V 2 

-r 21 

'Y'ZZ 

sy 23 

V 3 

-r 31 

_7^32 

_7^33 



From the definition (17.94) we see that the three-dimensional Maxwell stress tensor 
is a symmetric tensor. By examination of (17.95) we conclude that the complete four- 
dimensional electromagnetic energy- momentum tensor is also a symmetric tensor, that 
is, that we have 

T nk = T kn , T nk = T kn . (17.96) 

If the electromagnetic field is defined in a domain V with the boundary surface n, the 
expression 

F a = f d -^-dV = (f T afi drip, (17.97) 

Jv dxf> J n p 

defines the force due to the electromagnetic field pressure on the boundary surface n. 



PART 4 


General Theory of Relativity 





Gravitational Fields 


18.1 Introduction 


The special theory of relativity discussed in the last four chapters is based on the concept 
of the inertial frames of reference. An inertial frame of reference is defined as a system 
of reference where a free particle, that is, a particle on which there is no action of any 
external forces, moves along the straight line with constant velocity. On the other hand, 
the gravitational interaction, as one of the fundamental interactions in nature, is a long- 
range interaction which cannot be screened. The concept of inertial frames of reference 
is, therefore, not compatible with the gravitational phenomena. The only way to define 
an approximately inertial frame of reference is to visualize it as being far away from any 
matter. Given the influence of the force of gravity on the observations of various physical 
phenomena, both on Earth and in the universe as a whole, the concept of inertial frames of 
reference as a foundation for the formulation of the laws of nature is clearly not sufficient. 
Since the force of gravity is an unscreened long-range force, it can be considered as an 
intrinsic property of the space-time and related to the space-time geometry. 

In the absence of matter we can define the inertial frames of reference where the 
geometry of space-time is pseudo-Euclidean. The space-time metric is then given by 

d.s' 2 = gk„ dx k dx 11 — c 2 At 2 — d.t 2 — d y 2 — dz 2 , (18.1) 


such that the components of the metric tensor g mn are constant and given by the 
following matrix: 


[gm;i] — 


10 0 0 
0-100 
0 0-10 
0 0 0 -1 


(18.2) 


In the presence of matter it is not possible to find the four space-time coordinates x J such 
that the metric of the space-time manifold is reduced to the pseudo-Euclidean expression 
(18.1). Thus, the geometry of space-time in the presence of matter must be a pseudo- 
Riemannian geometry, where the components of the metric tensor g/ cn are functions of 
the coordinates xf that is, we have 

Skn = gkn(x j ) (j,k,n = 0, 1,2,3). (18.3) 


The components of the metric tensor of the given space-time manifold (18.3) can therefore 
be considered as the gravitation field potentials and the gravitational effects are described 
by the metric itself. 
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From (18.2) we note that the determinant g — — 1 of the pseudo-Euclidean metric 
tensor of special theory of relativity is a negative constant number. In the general theory 
of relativity g = g(xf < 0 is a negative definite function of space-time coordinates. In 
the ordinary positive- definite Riemannian metric spaces we have frequently used the 
function to define the absolute tensors and invariants. In the pseudo-Euclidean and 
pseudo-Riemannian space-times of the theory of relativity with g < 0, this function is 
not a real function and it is generally replaced by which is a real function. In the 
pseudo-Euclidean special theory of relativity we have = 1, so the relative tensors 
and invariants were transformed as the absolute scalars and invariants and the explicit 
appearance of the factor was not necessary. 

In the presence of gravity it is not possible to reduce the metric of the space-time 
to the pseudo-Euclidean expression (18.1) in the entire space. It is important to note 
that the gravitational effects are understood as a deviation of the space-time metric 
from the pseudo-Euclidean metric (18.1). The metric (18.3) is therefore a function of 
the local distribution of matter as a source of the gravitational field and cannot be fixed 
arbitrarily in the entire space. The metric tensor, as the gravitation field potential, is in this 
context a solution of the gravitational held equations, which will be derived in the next 
chapter. 

Although we cannot transform away the gravity in the entire space, we can select the 
local frames of reference falling freely in the gravitational held where the gravitational 
effects are locally transformed away. In a small laboratory falling freely in a gravitational 
held, the laws of nature are therefore the same as those observed in an inertial frame of 
reference in the absence of gravity. The laboratory has to be small since the gravitational 
helds are functions of coordinates and vary in magnitude and direction in different points 
in space. The force of gravity can therefore be considered as uniform and transformed 
away by a suitable choice of local frame of reference only in a sufficiently small space 
domain. 

Thus, we are in most cases able to cover the space-time with a patchwork of local 
inertial frames of reference. The concept of local inertial frames of reference is very useful 
in the general theory of relativity. 


18.2 Time Intervals and Distances 

In the general theory of relativity, the choice of the generalized coordinates used to 
describe the four-dimensional space-time manifold is not restricted in any way. Thus, 
the coordinates xJ are in general not equal to the distances and time intervals between 
events in the same way as in the special theory of relativity. Therefore, given a set of 
generalized coordinates xf we need to relate these coordinates to the actual distances and 
time intervals between the observed events. 

In the general theory of relativity we denote the proper time by r . Let us now consider 
two infinitesimally separated events which take place at exactly the same point in space. 
Then we have dx 1 = dx 2 = dx 3 = 0 and we may write 
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ds 2 = g kn dx k dx" = c 2 dr 2 = goo (dx 0 ^ • (18.4) 

Thus, we obtain the relation between the element of the proper time dr and the coordinate 
differential dx° as follows: 

dr = - c ^dx°, (18.5) 

or for the proper time between any two events occurring at the same point in space 

r = ~ c J y/goo dx°. (18.6) 

In the special theory of relativity the element d l of the spatial distance is defined as the 
distance between two infinitesimally separated events taking place at the same time. 
In the general theory of relativity we cannot use this definition, since the proper time 
is a different function of coordinates at different points in space. In order to find the 
element dl of the spatial distance, we consider two infinitesimally close points A and B 
with coordinates x J and x J + dx^ respectively. Let us suppose that a light signal is directed 
from point A to point B and then back from point B to point A along the same path. The 
time required by the light signal to travel from point A to point B and back, observed from 
one point in space and multiplied by c, is double the distance between the two points. For 
the two events representing the departure of the light signal at one point and the arrival 
at the other point we know from (13.4) that the square of the interval ds 2 is equal to zero. 
Thus, we may write 

ds 2 = g kn dx k dx 11 = goo(dx 0 ) 2 + 2 g 0a dx“ dx° + g a p dx a dx^ = 0. (18.7) 

Equation (18.7) is a quadratic equation with respect to dx° and its solutions are given by 

(dx°)± = ± — J(go a gop - googaft) dx“ dxA (18.8) 

goo goo v 

The coordinate time required by the light signal to travel from point A to point B and back 
is then given by 

(dx°)+ - (dx°)_ = —J(goagop - googap) dx" dxA (18.9) 

goo v 


Using (18.5) we obtain the proper time required by the light signal to travel from point A 
to point B and back, multiplied by c, as follows: 

cdr = y/goo [(dx°)+ - (dx°)_j 

= -^=J(gOcgop -googap) dx u dxp. ( 18 . 10 ) 

ygoo 


Thus, the spatial element dl between the two points A and B is a half of the proper time 
interval (18.10) multiplied by c, that is, 


dZ = 





( 18 . 11 ) 
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Thus, the square of the space metric d Z 2 can be written in the form 

d l 2 = Yap dx“ Ax p , Yap = -gap + g0ag0p . (18.12) 

goo 

It should be noted that the metric tensor gk n in general depends on time coordinate. 
The space metric dZ 2 is therefore also time-dependent and in general it does not make 
sense to integrate the element of the spatial distance dl, as such an integral would depend 
on the world line chosen between the two end points. Only when the metric tensor g k n 
is not time dependent and the distance can be defined over a finite portion of space, 
the integral of the element of the spatial distance dl along a space curve has a definite 
meaning. 


18.3 Particle Dynamics 

In the special theory of relativity the motion of a free particle of mass m is defined by the 
action integral (15.5) . The action integral of a free particle is simply proportional to the arc 
length in the four-dimensional space-time manifold, that is, 

C Sb f Se dx fc dx n 

I = —me / d s=-mc / \gkn~, r~ ds, (18.13) 

Js A Js a V ds 

where m is a mass parameter of the free particle. From (18.13) we see that the Lagrangian 
function is given by the expression 


L = 


-me 


gknll k U n 


(18.14) 


The equations of motion of a particle in the gravitational field are obtained by variation 
of the same action integral (18.13), since the introduction of the field of gravity is 
nothing but the change of the space-time metric to a pseudo -Riemannian metric with 
a coordinate-dependent metric tensor (18.3). On the other hand, except for the overall 
constant multiplier —me, the Lagrangian (18.14) is the same as the Lagrangian (11.14) 
used to derive the geodesic equations (1 1.25). Thus, we conclude that a free particle in the 
gravitational field moves along the geodesic lines and that the equations of motion are the 
geodesic equations (11.25), that is, 


d 2 x n dx l dx k 

i_ r n 

d.v 2 lk ds ds 


= 0 (k,l,n = 0, 1,2,3) 


(18.15) 


or 


d u n 
ds 


' 1 I 


\_u l u k = 0 


(k, l,n = 0, 1,2,3). 


(18.16) 


In order to find the non-relativistic limit of (18.16), we first note that the non-relativistic 
limit ( v < c) of the components of the four velocity (13.39) and (13.40) is given by 


u 


o 



CK 


if 1 


c 



if 1 

c ’ 


1, u l 


(18.17) 
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Thus, the leading term on the right-hand side of the expression (18.16) is the one with 
l = k = 0, such that we may write 


du' 1 „ 

_ = - r 0 " 0 (n = 0, 1,2,3). 


(18.18) 


For a static gravitational held with 3ogoo = 0, the only non-trivial equations in (18.18) are 
the spatial equations 

du a 


_ r" 

~dT ~ 00 


(a = 1,2,3). 


(18.19) 


Using here (18.17) and the zeroth-order non-relativistic approximation for the differential 
of the parameter ds c df, we further obtain 


1 _ r „ 


(a = 1,2,3). (18.20) 

Let us now recall the definition of the Christo ffel symbols of the first kind (9.28), that is, 


r k,jp ~ 


_ 1 / dgk[ dgpk _ dgjp 
~ 2 V dxP + dxJ dx k 

For a static gravitational held with 3o gj P = 0, we obtain 


(18.21) 


Ffc.oo — 


1 ( dgkO + dgQk _ 3g00 ^ _ 1 3g00 

2 dx k ' 


2 V 3x° dx° dx k ) 

Using now the dehnition of the Christoffel symbols of the second kind (9.29), that is, 

1 


(18.22) 


r n - r r«fcr, . _ 

1 jp — g 1 kjp ~ 9 g 


( , d 8p k _ d &P \ 
V 3 xP dxi dx k ) : 


we obtain 


P n rr nk T * „nk 3g00 

1 oo = g ‘t,oo = -yg —Qxk ' 


The spatial components of (18.24) give 


pa 1 „ otk 3g00 

100 ~~2 g ~dx k ' 

For a static gravitational held with 3ogoo = 0 we then obtain 

r 0Q = 


(18.23) 


(18.24) 


(18.25) 


(18.26) 


2° dxP ' 

Substituting (18.26) into (18.20), we obtain 

l^ = V^oo 
c 2 dt 2® dx? 

In order to hnd the non-relativistic approximation for the component goo of the metric 
tensor, let us consider the non-relativistic Lagrangian for a particle moving in the gravita- 
tional held dehned by the gravitational potential </>, given by 


(or, fl = 1,2,3). 


(18.27) 


1 9 

L = £k — V = -mir — m<t>, (f> = 0(x“). 


(18.28) 
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We may always add a constant term to the non-relativistic Lagrangian without affecting 
the dynamics. Thus, following the result (15.17) we add a constant —me 2 to the Lagrangian 
(18.28) and obtain 

L = —me 2 + ^mv 2 — m<p = — me 2 — ^ mg a pifv^ — m<p, (18.29) 

where g a p are the spatial components of the pseudo-Euclidean metric tensor. In the 
Descartes coordinates they are given by g a p = -8 a p. Using (18.29) we obtain the non- 
relativistic approximation to the action integral (18.13) in the form 

r tB i <j> l if dxP \ , 

r ~- mC J lA \ C+ ~e + 2 (18 - 30) 

or after some regrouping 


I = —me 



ds 


—me 



c df + -gap 



(18.31) 


From (18.31) we obtain the non-relativistic approximation for the line element ds as 
follows: 


ds « ^1 + dx° + ^g ap y dx p . 


(18.32) 


The non-relativistic approximation to the metric of the space-time manifold is then 
obtained by squaring the expression (18.32) and dropping the terms of the order v 2 /c 2 
and higher. Thus, we obtain 


ds 2 



(dx 0 ) 2 + 


<t> 


dx" 


1 + 72 )stf-dr d * / ’ 


dx° 

c 


+ \ (&>P dxP 

Here, we may drop the last term as the term of higher order in u 2 /c 2 to obtain 


(18.33) 


ds 2 ss (dx 0 ) 2 + ^1 + y ^ g a p dx" dx^. (18.34) 

Assuming that the gravitational field is relatively weak, we may use the approximations 


1 + 




(18.35) 


and 


+ ~z) Sap dx" dx^ - g a p dx" dx^. 


(18.36) 
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Substituting (18.35) and (18.36) into (18.34), we finally obtain 

ds 2 « ^1 + (dx 0 ) 2 + gap dx" dx^. 


(18.37) 


From the result (18.37) we obtain the non-relativistic approximation to the quantity goo in 
the form 


2(p 

gOO ^ 1 + ~2- 


Substituting (18.38) into (18.27), we obtain 




(18.38) 


(18.39) 


Using here the pseudo-Euclidean spatial metric g 01 ?, consistent with the result (18.37), we 
obtain in the three-dimensional vector form 

dt) 


— = -grad^>, 4> = 4>{x). 
at 


(18.40) 

If we now recall the non-relativistic result for the gravitational potential </>, we may write 

(18.41) 


GM , GM, 

<t>{x) = — => grad0 = — g— r. 


r r° 

where M is the mass of the of the source of the gravitational field, r is the three- 
dimensional radial coordinate given by 


r = -y/x 2 +y 2 +z 2 , 

and G is the gravitational constant given by 

m 3 

G= 6,67- 10“ n a • 

kgs 2 


(18.42) 


(18.43) 


Substituting (18.41) into (18.40) and multiplying by the mass of the test particle m, we 
obtain the result for the non-relativistic gravitational force between two bodies with 
masses M and m as follows: 


F G = m - = 


GMm . 


(18.44) 


Thus, in the non-relativistic limit the equations of motion of a particle (18.16) are reduced 
to the corresponding Newtonian non-relativistic equations of motion. 


18.4 Electromagnetic Field Equations 

The objective of the present section is to generalize the electromagnetic field equations, 
derived in Chapter 17, to the pseudo-Riemannian metric space in the presence of gravity. 
In the special theory of relativity the electromagnetic field tensor is defined by (16.14), 
that is, 


Fkn = ~ 3 n A k (k, n = 0, 1, 2, 3). 


(18.45) 
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As the partial derivatives do not transform as vectors in the Riemannian metric spaces, the 
expression (18.45) has to be modified to the covariant expression 

F kn = D k A n - D n A k (lc,n = 0,1, 2, 3). (18.46) 

Using here the definition of the covariant derivatives (8.24), that is, 

D k A n = d k A n - r ] nk Aj, D n A k = d n A k - r[ n Aj (18.47) 

we obtain 

Fkn = 3 k An - r j nk Aj - dnAk + T [ n Aj. (18.48) 


Using here the symmetry of the Christoffel symbols of the second kind (9.9), that is, 

r L = r L (k,m,p = 0,1, 2, 3), (18.49) 


the second and fourth terms on the right-hand side of (18.47) cancel each other and 
we reproduce the pseudo-Euclidean result (18.45). Even in the presence of gravity the 
definition of the electromagnetic field tensor (18.45) remains valid. Using the definition 
of the covariant derivative of a second-order covariant tensor (8.33), we have 


DjFkn = djF kn - r IjFin - r l n jF kh 
D n F jk = dnFjlc - r ) n Fik - r [ n F jlt 
F> k F n j = S k F n j — T l nk Fij — r l jk F nl . 

Using now the antisymmetry of the electromagnetic field tensor 

Fin = ~Fnh Fkl = -Flic, Fji = -Fy, 
and the symmetry of the Christoffel symbols of the second kind 

T~' l I-' / / T'' l r'' / T - ' l 

1 kj ~ 1 jk’ 1 nj ~ 1 jn } 1 kn ~ 1 nk’ 

we obtain from (18.50) the cyclic expression 


(18.50) 


(18.51) 


(18.52) 


DjFkn + D n Fj k + D k F n j = djF kn + d n Fj k + d k F n j = 0. 


(18.53) 


Thus, we conclude that the first pair of Maxwell equations remains unchanged in the 
presence of gravity and has the form 


dF kn 3 Fj k dF, 


+ _li^ + ll^. = 0 (j,k,n = 0, 1,2,3). 


(18.54) 


dxJ ' dx n ' dx k 

In order to derive the second pair of Maxwell equations, let us consider the interaction 
Lagrangian (17.6) in the form 


1 r dx n 

In = / a— —A n d£L (18.55) 

v c Jn dt 

The four-dimensional volume element, defined in the pseudo-Euclidean space-time of 
special theory of relativity by the equation 


dQ = dx° dV = dx° dx 1 dx 2 dx 3 


(18.56) 
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is no longer an absolute scalar. Instead of (18.56) we define the invariant volume element 
as sf^gdQ. Thus, we rewrite the interaction Lagrangian (18.55) as follows: 

•« - J c L - j < ia - 57 > 


Since the action integral (18.57) and the four-dimensional volume element gdU are 
both absolute invariants and 71, ; is a covariant four-vector, the system defined by 


/ 


n 


a c dx n 

7=f 


(18.58) 


is by definition the contravariant electromagnetic current four-vector. The temporal 
component of the four-vector (18.58) is proportional to the charge density a and is 
given by 


/ — c 




and the three-dimensional current density vector is 

r = -^=ia 


(18.59) 


(18.60) 


Given the definition (18.58) we can generalize the second pair of Maxwell equations (17.38) 
by replacing the partial derivative by the covariant derivative D k . The second pair of 
Maxwell equations then becomes 

D k F kn = mo/" (k, n = 0, 1, 2, 3). (18.61) 


Using the definition of divergence of an arbitrary tensor with respect to one of its 
contravariant indices (10.15), we obtain 

DkFkn = ^ (j-8 pkn ) = ^J n . (18-62) 

In the same way we generalize the equation of continuity (17.45) to obtain 


D„r = 


(yFgJ n ) = 0 . 


(18.63) 


y/-g dX n 

Thus, the complete system of electromagnetic field equations in the pseudo-Riemannian 
space-time manifold, which includes the gravitational effects, is given by 


dF kn dF jk dF, 


dxi dx n 

with the continuity equation 


w 

dx k 


= o, 


1 


V~g dX k 


(V^gF kn ) = M0 /" 


1 


a 


V~g d x ' 1 


(y=g/") = o. 


(18.64) 


(18.65) 


The pseudo-Riemannian four- dimensional formulation of Maxwell equations in the pres- 
ence of gravity given by (18.64) with (18.65) is quite compact and it nicely illustrates the 
general prescription for the introduction of gravity in the laws of physics. 
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In general, the laws of physics must be valid in all systems of reference and expressed as 
covariant tensor equations. The field equations of physics, which involve the derivatives 
of the field variables, must therefore be rewritten such that the ordinary derivatives are 
replaced by the covariant derivatives. Even if we are working in a flat space-time described 
by some curvilinear coordinates, we need to use the covariant derivatives if we want the 
field equations to be valid in all coordinate systems. 

If we assume that a physical object described by a set of pseudo-Euclidean equations 
does not represent an appreciable source of the gravitational field, then it does not 
significantly influence the components of the metric tensor g kn (x-i) as the potentials of 
the gravitational field. In such a case the geometry of the four-dimensional space-time 
manifold is rigidly determined by some massive external sources of the gravitational field. 
Then the effect of the physical object under consideration on the geometrical structure 
may be neglected. Under these circumstances the set of pseudo-Euclidean equations 
describing the physical object can readily be generalized to incorporate the effects of 
gravity by making the substitution 

d -» D, d k -* D k , dS2 -* v /=gdS2. (18.66) 

Thus, the prescriptions (18.66) can be used to generalize any pseudo-Euclidean equation, 
used to describe a physical object which is small in comparison to the sources of the 
gravitational field, to incorporate the gravitational effects and to be valid in all systems 
of reference. 



Gravitational Field Equations 


19.1 The Action Integral 

In Chapter 18, we have concluded that the components of the metric tensor of a given 
space-time manifold are the potentials of the field of gravity and the gravitational effects 
are described by the metric itself. The metric tensor is therefore a system of functions of 
the local distribution of matter as a source of the gravitational field and cannot be fixed 
arbitrarily in the entire space. The metric tensor as the field potential is therefore a solution 
of the gravitational field equations. The objective of the present chapter is to derive the 
gravitational field equations from the suitable action integral of the gravitational held. 
The total action integral for a system consisting of a continuous distribution of matter 
as the source of the gravitational held and the gravitational held itself is given by 

I = Iq + Im, (19.1) 

where I G is the action of the gravitational held in the empty space, where there are no held 
sources, and Im is the action describing the interaction of the matter distribution with the 
gravitational held. The action of the gravitational held in the empty space can be written 
in the form 

Ig = -[ £g {gkn, djgkn) (19.2) 

c Ja 

where C G is the invariant Lagrangian density of the gravitational held which is integrated 
over the invariant four-dimensional domain O. The action describing the interaction of 
the matter distribution with the gravitational held can be written as follows: 

1m = - f Cm (gkn> djgkn) g'-gdQ, (19.3) 

c Jn 

where Cm is the invariant Lagrangian density of the source helds. For example, if the source 
of the gravitational held is the electromagnetic held, then Cm is the Lagrangian density of 
the electromagnetic held (17.52), that is, 

Cm (gkn) = - ^g k jgnlF kn Ffl. (19.4) 

It should be noted that the electromagnetic held F kn in (19.4) is taken as the given source 
of the gravitational held and the Lagrangian Cm is varied with respect to the gravitational 
held potentials g kn - 

In order to construct the invariant Lagrangian density of the gravitational held C G , we 
note that it is a function of the metric tensor g kn and its hrst derivatives 'djgtn only. On the 
other hand, by using (9.25) in the form 
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djgkn = T n ,kj + T k,nj = gnl T kj + Skl r hj> (19.5) 

we see that the first derivatives of the metric tensor djgkn can always be expressed in 
terms of the suitable metric tensors and Christoffel symbols of the second kind. Thus, 
the invariant Lagrangian density of the gravitational field £ G can be seen as a function 
of the metric tensor g kn and the Christoffel symbols of the second kind Y 1 ,. Thus, we 
may write 


C G = C G (g kn ,r j kn ). (19.6) 

The only non-trivial tensors that can be created from the metric tensor g kn and the 
Christoffel symbols T ] kn are the curvature tensor Rj kn defined by (12.14) as follows: 


<„ = hli - 9/r{.„ + r& rj, n - 1*^, (19.7) 

and the tensors obtained by contractions of the curvature tensor, for example, the Ricci 
tensor R kn defined by (12.44) as follows: 


R kn = Knj = d Aj - + r£/ p „ - r p kn r j pj , (19.8) 

and the Ricci scalar R defined by (12.50) as R — g kn R kn - 

Thus, a suitable candidate for the invariant Lagrangian density £ G is proportional to 
the Ricci scalar R. In such a case we may choose 


Cq = — 


16tt G 


R, 


(19.9) 


where G is the gravitational constant given by (18.43). The factor of proportionality in 
(19.9) is added to ensure the correct dimensions of the physical quantities. The action 
integral (19.2) may then be written in the form 


c 3 7 

Jg = “T 7T-F / d£2. (19.10) 

16 jr G Jn 

However, from the above definitions we note that the Ricci scalar R depends not only on 
the metric tensor g kn and its first derivatives djg k n> but also on the first-order derivatives 
of the Christoffel symbols <HY 1 kn and thereby also on the second-order derivatives of 
the metric tensor. On the other hand the gravitational field equations must contain up 
to the second-order derivatives of the metric tensor as the gravitational field potential. 
As these equations are obtained by the variations of the action (19.2), the Lagrangian 
density £ G must not contain the derivatives of the metric tensor of the higher order than 
the first. 

In order to resolve this difficulty, we need to analyze the structure of the Ricci scalar R. 
If it can be shown that the terms containing the second-order derivatives of the metric 
tensor do not contribute to the variations of the action integral (19.10), then we could 
proceed using the action integral (19.10) in the derivation of the gravitational field 
equations. Since the Ricci scalar is the only available non-trivial scalar invariant created 
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using metric tensor g^ n and the Christoffel symbols of the second kind F ] kn , it is important 
to show that the above assumption is indeed valid. Let us therefore calculate the quantity 
y/—gR, as follows: 

ypg R = V-gg knR kn = V^8g knd n^ j kj 

- + v^gg kn ( r fc,ipn - r L r y 

(V-gg knr {j) - 9; {V^g^kn) - ^ (^gg kn ) ^ 

+ 9; (y=gg fcn ) r L - ( r L r it - r fc;4») • (19-11) 

Interchanging the dummy indices n o- j in the first term on the right-hand side of (19.11), 
we obtain 

7=P = 9/ (y=^ r «„ - yf-gg^in) 

+v /z ^ r Ly= 9 t (yygg^) 

-y=gg fc "(r^.-r^4„). (19.12) 

Here, we note the result (9.48) in the form 

g ]n rf n = - - (19-13) 

and we calculate 

y=9j (y=gg fc,i ) = g kn dj (lny=^) + djg kn . (19.14) 

Using here the result (9.36) in the form 

r?=9/( In y=g), U9-15) 

we obtain 

^=9; (y^g"") = g fc "r£ + djg k ’\ (19.16) 

Using further the result (8.29) applied to the metric tensor g kn and the property that the 
covariant derivative of the metric tensor is equal to zero, we have 

Djg kn = djg kn + r pjg kP + r pjg nP = 0, (19-17) 

or 

djg kn - ~ r pjg kp - r p jg np - 

Substituting (19.18) into (19.16) we obtain 


(19.18) 
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Substituting (19.13) and (19.19) into (19.12) we obtain 

= 9/ (V-gg kJr k n - J-gg kn r J k n) 

+ ^L(g kn ^ p - rn pj g kP -^ j g nP ) 

+ y^g r kp g Jn rfn - y^gg kn ( r kn rJ pj - rP k j rJ pn) H9.20) 

or after regrouping 

y=P = dj (V-gg kJr kn - y-gg kn r{n) 

+ V^g(r j kn g kn r j P + r k P g jn r f„) - y^gr[ n {vy* + r*/*) 

-y-gg^K^-^Pn)- ('9.21) 

Interchanging the dummy indices k -o- j in the second term on the right-hand side of 
(19.21), gives 

y=& = a J (y^g^L - y^g k, Kn) 

/ nrjkll (yj yP , yP yj \ _ / yj y?l „kp . yj pfc p ,np\ 

^ V 65 (‘ kn jp ^ 1 jp 1 kn) V 6 ^ foi pj5 T i fc „i p] & J 

~ y~gg kn ( r kn r pj - r kjTpri) ■ (19.22) 

Interchanging now all dummy indices in the third term on the right-hand side of (19.22), 
we obtain 

y^gR = 9/ (y-gg kj r n kn - y-gg kn r j kn) + 2 y-gg kn r L r jp 

- J-gi^npgb" + ^Apg kn ) 

-y^gg kn (^ pJ -r^pn). (19.23) 

Using now the symmetry of the Christoffel symbols of the second kind with respect to its 
two lower indices (9.9), the result (19.23) becomes 

y^gR = dj (y-gg kj r» kn - y-gg kn r{ n ) + 2 y^ g k " r[ n r p p 

- 2 y^gg kn r p kj r J pn - ^gg k " (r kn r j pj - r p kj r J pn ) (19.24) 

yj yP _ yP yj \ 

1 kn jp 1 kj pn ) 


or 


y^R = dj (y=ggv t “„ - y = g g kn rj tn ) + 2 y^ gg kn ( 

— /ZZp M ( pP p j _ y p r J 'l 

V 55 kn pj 1 kj 1 P n J ‘ 


(19.25) 
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Now we note that the second term on the right-hand side of (19.24) is twice the third term 
with a positive sign. Thus, we obtain 

= 9/ (v-gg kJr k n - v^s kn it) 

+ V=h kn ( r p k y pJ -r^ pn ). (19.26) 

The final result for the quantity ^f—gR can therefore be written in the form 

y=gR = djw> + y^gr, G9.27) 

where we define 

r = g kn ( r kn r pj ~ r kj r pn) ' ('9.28, 

and 

w j = y 7 — g (g kj V n kn - g kn r{„) . (19.29) 

Here, we note that the quantity F depends only on the metric tensor g kn and its first 
derivatives djg kn , while the terms containing the second-order derivatives of the metric 
tensor in the Ricci scalar R are collected into an expression which has the form of the 
divergence of a vector, that is, djwR Substituting (19.29) into (19.10) we obtain 

Ig = ~TR~r I 1 - 7Z~r l ¥ vi d °- 

16 n G Jn 16 nGJn 

(19.30) 

According to the Gauss theorem the second integral on the right-hand side of (19.30) can 
be transformed into an integral over a hypersurface surrounding the domain £2 over which 
the integration is carried out in the first integral. Thus, the variation of the second integral 
vanishes due to the variational principle which requires that the variations of the fields at 
the limits of the domain G are equal to zero. Consequently, we have 

SIg = “ 7 ^ — r S / Rj-gdV = / ry^gdQ. (19.31) 

16jt G J n v 16n-G 

It should be noted that the variation of the action (19.31) is a scalar invariant, although the 
integral 

- 7 ^— 7 ; [ ry^dQ, (19.32) 

16 Jr G J a 

and the quantity T defined by (19.28) are not scalar invariants. Thus, we have shown that 
the terms in the Ricci scalar R containing the second- order derivatives of the metric tensor 
do not contribute to the variations of the action integral (19.10) and that we may use the 
action integral (19.10) in the derivation of the gravitational field equations. 
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19.2 Action for Matter Fields 


The variation of the total action (19.1) of the gravitational held in the presence of matter is 
given by 


SI = SIq + SIm — o. 


(19.33) 


Thus, we need to calculate the variations SIg and SIm- In this section we consider the 
variation of the action integral of matter fields Jm, that is, 


or 


SI M = - [ 
c Jn 


SIm = - f 8 (y/—gC m ) d£2, 
C Jn 


9 h, 9 (yf—gC-M) 


dg‘ 


kn 


-Sg Kn + 


9 ( djg kn ) 


(V“) 


dfi. 


(19.34) 


(19.35) 


Integrating the second term in the integral (19.35) by parts and dropping the integral over 
the hypersurface boundary of the domain of integration £2, following the same steps as in 
the case of the electromagnetic held, we obtain 


SIm = - , 
c Jn 


9 


dg 


kn 


-di 


9 (y-g^M) 

9 ( djg kn ) 


Sg kn dQ. 


(19.36) 


Let us now dehne the energy-momentum tensor T kn of the matter helds as follows: 


2 %/ ]jTkn — 9 / 


9 (V=g£ M ) 


9 (V=^m) 


dg 


kn 


(19.37) 


(19.38) 


9 {djg kn ) 

Substituting (19.37) into (19.36) we obtain 

SIm = ~^~ [ T knSg kn % / = g dt2. 

^ c JQ. 

Using here the result S (g kn gjn) = S8 k — 0, we may write 

T kn Sg kn = gk, T lk gnjSg kn = —gkl T l jg kn Sg n j 

= -s?T l hg nj = -T n hg nj = -T kn Sg kn . 

Substituting (19.39) into (19.38) we obtain 

SIm = ^~ f T kn 8g kny pg da. 

Ji 2 

Let us now calculate the energy- momentum tensor for the electromagnetic held with Cm 
given by (19.4). In the case of the electromagnetic held the Lagrangian Cm is a function of 
the metric tensor g kn only and not of its derivatives djg kn . Thus, using (19.37) we obtain the 
contravariant energy- momentum tensor in the form 

2 9 (V=£C M ) 


(19.39) 


(19.40) 


rj->kn _ 


yj 8 dgkn 


= -2 


SCm 


2 Sy/=g 


Sgkn V g Sgkn 


Cm- 


(19.41) 
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Using here the result 

2 1 dg G kn kH ' 

\J § dgkn g dgkn g 


we obtain from (19.41) 


j 'kn 


Using now (19.4) we may write 


2 ^m 

dglcn 


g kn c M - 


9£m 

dgkn 


-J- FP q F ]l JL ( g g ) = + J_ F P‘t F li 28 * S ?g ql 

4 MO 3 glen * 4 mo p 1 ' 


( 19 . 42 ) 


( 19 . 43 ) 


= +^-F kcl F ln g q l = 

2 MO 2 mo ' 

Substituting (19.4) and (19.44) into (19.43), we obtain 

jkn _ __L pkjpn _| gknpjlp 

Mo J 4 l/.n 1 


4mo 


( 19 . 44 ) 


( 19 . 45 ) 


The result (19.45) is identical to the result (17.65) obtained by other means in the special 
theory of relativity with pseudo-Euclidean geometry. 

As a next example, let us now consider a continuous distribution of non-interacting 
particles with a total mass m and a rest-energy £o in a three-dimensional domain of 
volume V. If we denote the mass density of the matter distribution by p, we may write 


m= [ p dV, £q = f p(? dV. 
Jv Jv 


The action integral of the given matter distribution can be written in the form 


I K = —me f ds = —c [ pdV [ Jgndxjdx 1 
Js A Jv Ja v 


or 


Ik = 


-eff 

Jv Jt , 


vJt A V=gV JI dt dt 


dxi dx l , — , w , 
Sji^7^7^8dVdt. 


( 19 . 46 ) 


( 19 . 47 ) 


( 19 . 48 ) 


Introducing here a system as follows: 

p dx n 

<j>" — — 

V=g dt ' 

the action integral (19.48) can be written in the form 

h = - J C-MKyf-gdQ,, Cmk = -Cyj gji<&i4> 1 ■ 


( 19 . 49 ) 


( 19 . 50 ) 


Since Cmk is the invariant Lagrangian density, we see from (19.50) that the system O" is 
a contravariant four-vector. The temporal component of the vector <f> ,! is proportional to 
the mass density p and is given by 
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$o= pC 


v=r 


(19.51) 


The spatial components of the vector <E>" are proportional to the flux of mass through a 
unit of the boundary surface dll of the volume V in the unit of time, that is, 

d 3 m 


<j>“ = 




if = 


dn a df ’ 


(19.52) 


The definitions (19.51) and (19.52) are fully analogous to their respective electromagnetic 
counterparts (17.59) and (17.60). In analogy with the electromagnetic result (18.63) the 
mass conservation law gives the following continuity equation: 


D n <t> n = 


1 


= 0. 


*J=gdx n 

In the three-dimensional vector notation the continuity equation (19.53) has the form 


(19.53) 


— + div(py) = 0. 
dt v ’ 


(19.54) 


Integrating over the volume of the three-dimensional domain V and using the Gauss 
theorem, we obtain the integral form of the mass conservation law 

d 


df 


[ pdV = f 

Jv Jr. 


= / pv ■ dn. 


(19.55) 


According to the mass conservation law (19.55), the negative increment of mass m within 
a three-dimensional volume V is equal to the total mass flux through the boundary surface 
n of the volume V in the unit of time. Using now (19.48), we may write 


yPgCMK = -pc^Jgjiuiu 1 = -pc|. 


(19.56) 


Using (19.41) we obtain the energy- momentum tensor of the given mass distribution as 
follows: 


j'kn _ 


2 3 (V~S^mk) 


p c ds l. „ 
-u k u n 


(19.57) 


N/=g 3 gkn dt 

In the pseudo-Euclidean case we note that the component T 00 gives the correct result for 
the energy density 


r j -< 00 


pc 


/ 


1 - — 
1 .2 


and the components T u " give the correct result for the momentum density 


rpOa 


p if 


(19.58) 


(19.59) 


For a weak gravitational field at low velocities, the following limits can be used: 

((iky = )yy() * 0, Safi * ^afi , 


(19.60) 
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and 


ds -*■ y/, gooc d t, 

Substituting (19.61) into (19.57), we obtain 


V~s Vsoo- 


T’kn „ 

1 p C ll hi . 


(19.61) 


(19.62) 


The energy- momentum tensor (19.62) can be considered as the kinetic energy- 
momentum tensor which does not include the contribution from the internal energy 
of the given matter distribution. In order to include the contribution from the internal 
energy of the matter distribution, we may use the result for the differential of the internal 
energy dU = —pdV, where p is the pressure within the matter distribution. Thus, we 
obtain the action integral corresponding to the internal energy in the form 

n tB 


lu = - 


f Udt =- ( C M Uy/^g d£2, 
J t A C J £2 


where the Lagrangian density Cmu is given by 


„ d U 

CMU = ~dv =p - 


(19.63) 


(19.64) 


As the Lagrangian density Cmu does not include any dependence on the metric tensor g kn 
we use the constraint 


1 - Jgjiidu 1 = 0 

to create a Lagrangian function/ = f(u k , X) with the constraint 

f(u k ,xj = pjgjiuhi 1 till— ^jgjiui" 1 


(19.65) 


(19.66) 


Using the method of Lagrange multipliers the equations for the parameter X and the 
variables u k are given by 


^ = 1 
dx 


gjiuhi 1 = 0 => Jgjiuiu 1 = 1, 


and 


df 1 / — 

rr = 7 (P “ A ) Jsjiuiu l 2g k jui = o => k = p. 


(19.67) 


(19.68) 


du k 2 

Using (19.68) we obtain the suitable Lagrangian density for the calculation of the energy- 


momentum tensor 


Cmu = P + P I 1 - Jgjiu ] w = p. 


(19.69) 


The contribution to the energy- momentum tensor from the Lagrangian density (19.69) is 
obtained using (19.43) as follows: 


r rkl2 n 

1 (U) - z 


d§kn 


g kn C M = p( 


u k u n 


y kn\ 

5 )• 


(19.70) 
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where we used (19.69) to calculate 


9 Cm 
dgkn 


k 1t n 


? -y]gjlU j U l 


u u 


1 1* n 

pu U . 

2 y 


(19.71) 


Putting together the results (19.62) and (19.70), we obtain the total energy- momentum 
tensor of a matter distribution in the form 


rfkn rpicn I rjH 

1 — 1 (K) ■+■ 1 ( 


•kn 


rkn 


= (p + pc^j U k l 


-g kn p. 


(19.72) 


The result (19.72) is the most commonly used form of the energy-momentum tensor 
of a matter distribution in the general theory of relativity. It should be noted that the 
quantity T 00 is always positive, and in most practical calculations the contribution from 
the pressure terms can be neglected. 


19.3 Einstein Field Equations 

In this section, we consider the variation of the action integral I G in (19.33), which 
describes the gravitational field itself, and derive the gravitational field equations. Using 
the result (19.31) we have 

«G = f R/^dQ. (19.73) 

16? tG J q 

In order to calculate the variation (19.73) we need to calculate 

<5 (V~g R ) = 5 y~gg knR kn) 

= 9 (V^g) R + J^gS g knR kn + g kn SR kn . (19.74) 

Using here the results (9.35) and (9.40), we may write 

8g = gg^Sgkn = —ggkn 8g kn > (19.75) 

and we have 

H-J-=S) = - I j=Sg = - I j=(-ig t ,,Sg 1 ’). (19.76) 

or 

9 C/=g) = - ^ gknSg kn ■ (19.77) 

Substituting (19.77) into (19.74), we obtain 

9 (yFgR) = ( R kn - \gknRj 8g kn + ^gg kn 8R kn . (19.78) 

Next, we calculate the expression g kn &R kn as follows: 

g kn 8Rkn - g kn 8 (a, 4 . + dj r j kn + r p kj r J pn - r p kn r j pj ) 

= g kn 8 (dnr j k] - 4 „) - g kn sr p kn r j pj - g kn r p kn sr j pj 
+ g kn 8T p k y pn + g kn r£«r£„ + g kn r J nj 8r p kp - g kn r j nj 8r p p 


(19.79) 
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where we have added the term 

s kn Kj sr kp - S^njKp = 0 n9.»0) 

at the end of (19.78). Regrouping the terms, (19.79) becomes 

g kn sR kn = g kn s (dnr j kj - djr j kn ) + g k » (r j nj sr p p - r j pj sr p kn ) 

+ g kn r IjKn + g knsr kj r pn - g k 'Kr Srj pj ~ S^njKp' d9.81) 

Renaming the dummy indices, we obtain 

g kn sR kn = g kn s (d n r p kp - d p rQ + g k » (r l nl sr p p - r l pl sr p n ) - 

+ g kllr ' kp <, + g kn K P r ni ~ g knr l srl P i ~ g knrl m srP k p d 9 - 82 ) 

or 

g kn sR kn = g k "s (d n r p kp - d P r p n )+g kn (r l nl sr p p - r l pl sr p n ) - 

+ g Jnr jp Sr nk + g kir jp sr kn - g jnr fn^ kp ~ g^jn^lp- d9.83) 

Using the symmetry of the Christoffel symbols with respect to the lower two indices, we 
may rewrite (19.83) as follows: 

g kn 8R kn = g kn 8 (d n r p kp - dp r p n ) + g kn (r l nl sr p p - r^r*,) 

- (~g ]n rf p - g kj : r? ) sr p kn + (-g^rf n - g^r» ) sr p kp . (19.84) 

From the results D p g kn = 0 and D n g kn = 0, we have 

8pg kn = -r k p g Jn - rj pg kj , 

d n g kn = -r k n g> n - r f n g k C (19.85) 

Substituting (19.85) into (19.84) we obtain 

g kn 8R kn = g kn d n (ary - g kn d p (<„) 

- K,M kn + Kp d ng kn + g kn { r ni sr kp - r pl<n) (19.86) 

or 

g kn SR kn = d n ( g kn 8F P kp ) - d p ( g kn sr p n ) 

+ r hl {g knsr k p) - rl pl {g knsr l) ■ (19.87) 

Interchanging the dummy indices k o- p in the first and third term on the right-hand side 
of (19.87), we obtain 

g kn 8R kn = d P (g kp sr kn - g kn sr kn ) 

+ Kl(s kPSr kn-g k ' lSr kn) 


(19.88) 
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Introducing here a four- vector a/’ as follows: 

of = g kp sr^ n - g kn &r p kn 

and using F l pl = d p In the result (19.88) becomes 


(19.89) 



(19.90) 


or 


g<™ S R kn = ' dp (^a>P). 


(19.91) 


Substituting (19.91) into the result (19.78) we obtain 


8 (V-g R ) = V-g ( R hi - \ gknRj 8 g kn + dp (T 3 g " P ) 


(19.92) 


Substituting further (19.92) into (19.73) we obtain 





(19.93) 


The second integral in (19.93) can, by means of the Gauss theorem, be transformed into 
an integral of a/’ over the hypersurface surrounding the entire four-dimensional domain 
Q. When we vary the action Iq, the variation of the second integral vanishes due to the 
variational principle which requires that the variations of the fields at the limits of the 
domain £2 are equal to zero. Thus, the second integral in (19.93) can be dropped and we 
obtain the final result for the variation of the action integral for the gravitational field in 
the form 



(19.94) 


Using now the result (19.33), that is, — SI G = SIm with (19.38) and (19.94), we obtain 




(19.95) 


Since the variations of the metric tensor are arbitrary, the result (19.95) gives Einstein field 
equations for the gravitational fields in the form 


R kn 2 gkn R — ^4 R kn- 


( 19 . 96 ) 


Chapter 19 • Gravitational Field Equations 1 69 


The field equations (19.96) with mixed tensors iff, and T„ are given by 


p/C c/C p 

^ n 2 ~ 


8 7T G 


(19.97) 


Contracting (19.97), we obtain 


R"--8"R=R-2R=-R=-^-^T. (19.98) 

" 2 " c 4 

Substituting (19.98) into (19.96), we obtain an alternative form for the gravitational held 
equations 


Rkn — 


8n G 


(r kn - ^gk n Tj ■ 


(19.99) 


In empty space we have T kn = 0 and (19.99) give R kn — 0- The result Rkn — 0, however, does 
not mean that the space-time is flat (i.e., pseudo -Euclidean). The condition for the space- 
time to be pseudo-Euclidean is that the curvature tensor RL is identically equal to zero, 

thatis - Kin = °- 



Solutions of Field Equations 



In Chapter 19, we have derived the gravitational field equations. The gravitational field 
equations are non-linear equations and they include the self-interaction, that is, the 
interaction of the gravitational field with itself. The principle of superposition, which 
is valid for the electromagnetic fields in the special theory of relativity, is therefore not 
valid for the gravitational fields. However, in most practical cases we work with weak 
gravitational fields for which the field equations can be linearized and the principle of 
superposition is approximately valid. In this chapter we discuss the non-relativistic limit 
of the gravitational field equations and the simplest solution of the complete gravitational 
field equations for the static spherically symmetric field produced by a spherical mass M at 
rest, known as the Schwarzschild solution. The Schwarzschild solution has played a major 
role in the early development of the general theory of relativity and is still regarded as a 
solution of fundamental importance. 

20.1 The Newton Law 

In order to find the non-relativistic limit of the gravitational field equations (19.96) or 
(19.99), we start with the definition of the Ricci tensor (19.8), that is, 

R kn = dnr J kj - djr{ n + r p kj r j pn - rj^.. (20.i) 

In the non-relativistic limit in the static gravitational field, with the approximate metric 
given by (18.37), the only non-trivial equation (20.1) is the one with k = n = 0. Thus, we 
calculate 

Roo = 9 o^ 7 . - 3,rj 'o + r^rjo - r p 0 r j pj « -d a r a 00 . (20.2) 

For the static gravitational field with dogkn = 0 we may use the approximation (18.26) with 
(18.38), such that we have 

roo = -^Voo = -^9> (20.3) 

Substituting (20.3) into (20.2), we obtain 

Roo « -a«r^ 0 = \d a d a <t>. (20.4) 

From the result for the energy-momentum tensor of a matter distribution (19.72), in the 
static non-relativistic case under consideration, we obtain 

Too ~ pc 2 uqUq « pc 2 , T ~ pc 2 g kn Uk u n = pc 2 . (20.5) 
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Thus, we obtain 

Too - -gooT = -pc 2 . (20.6) 

Substituting the results (20.4) and (20.6) into the gravitational field equation (19.99) with 
k = n = 0, we obtain 


or 


Roo = -y d a d a (j> = -T— 


(^Too - -goo T^ = 


An G 


(20.7) 


-d a d a (t> = V 2 4> = AnGp. 


( 20 . 8 ) 


Thus, we obtain the non-relativistic gravitational field equation for the gravitational 
potential </> in the form 

V 2 0 = 4 nGp. (20.9) 

The solution of (20.9) is given by 

0 (?) = ~G f (20.10) 

Jv ft 

where R is the three-dimensional distance between the position of the sources and the 
position where the potential is calculated, that is, 


R = J |je" -x“| 2 . (20.11) 

For a uniform mass distribution over the volume V, we obtain 

r (2012) 

The result (20.12) agrees with the result (18.41) which leads to the Newton law of gravity 
(18.44). Thus, the gravitational field equations (19.96) or (19.99) have the correct non- 
relativistic limit (20.9) which leads to the Newton law of gravity. 


20.2 The Schwarzschild Solution 

The Schwarzschild solution is a solution of the complete gravitational field equations for 
a static spherically symmetric field produced by a spherical mass M at rest. The static 
condition requires that all the components of the metric tensor g are independent on x° 
or time t. Furthermore, we must have g a o = go a = 0. The spherical symmetry suggests the 
choice of the spatial coordinates as the three-dimensional spherical coordinates (r, 9, <p). 
The most general spherically symmetric metric satisfying the above conditions can be 
written in the form 


ds 2 = e 2l ’ (r) c 2 df 2 — e 21(r) dr 2 — r 2 (&6 2 + sin 2 0d<p 2 \ 


(20.13) 
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where v = v(r) and a = a ( r) are yet unspecified functions of the radial coordinate 
r. In the metric (20.13) we are using the exponential functions in order to secure the 
right signature of all terms. Thus, the covariant metric tensor of a static and spherically 
symmetric gravitational field is given by 


[gmn] — 


e 2v 

0 

0 

0 


0 0 0 

-e 2 ^ 0 0 

0 -r 2 0 

0 0 — r 2 sin 2 9 


(20.14) 


The contravariant metric tensor is then given by 


[g mn ] = 


e~ 2v 0 0 0 

0 -e~ 2x 0 0 

0 0 — 2 0 

0 0 0 —r~ 2 sin~ 2 9 


(20.15) 


The Christoffel symbols of the first kind for the metric (20. 13) can now be calculated using 
the definition (9.28), that is, 


(20.16) 


_ 1 / dgjfc dgnj dgkn ^ 

jM - 2 ^ dx n + dx k dx j 

The results for the Christoffel symbols of the first kind for the metric (20.13) are summa- 
rized in the following list: 


Fo.oo = -Oogoo + 9ogoo - 3ogoo) = 0, 

To, oi = r 0i i 0 = -Oj-goo + 3ogio - 3ogoi) = v'(r) exp(2v), 

1 

F 0,02 = To , 20 = -(do goo + 3og20 - 3og02) = 0, 

1 

To, 03 = To, 30 = -Opg 00 + 3og30 - 3og03) = 0, 

1 

Fo.ll = —13,^01 + 3;-gl0 - 3ogll) = 0, 

1 

To, 12 = F o ,21 = -Oegoi + 3rg20 - 3ogl2) = 0, 

To, 13 = To, 31 = -Opgoi + 9rg30 — 9ogl3) = 0, 

F 0,22 = -(3eg02 + 3flg20 - 3og22) = o, 

1 

To, 23 = To, 32 = -(3pg02 + 9flg30 - 9og23) = 0 

r 0,33 = “(3^03 + 9ipg30 - 9og33) = 0 

Fi.oo = -Oogio + 3ogoi - 3 r goo) = — v'(r) exp(2v), 


r 1,01 = r U o = -Orgio + 3ogll - 3 r goi) = o, 
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T 1,02 = Ti ,20 = -( 3 flgl 0 + 9 og 21 - d r g 02 ) = 0 , 

Tl,03 = Ti, 30 = -O.pglO + 9og31 - Srgto) = 0, 

r 1,11 = -Orgll + drgu - 9rgl 1 ) = — A.'(r) exp(2A), 

1 

T 1,12 = IT, 21 = -(3flgll + drg21 - drg 12) = 0, 

1 

T 1,13 = T 1,31 = -(3,,gll + 9 r g31 - 9 r gl3> = 0 , 

1 

T 1,22 = -Oegi 2 + degzi ~ 9 r g 22 ) = r, 

1 

T 1,23 = T 1,32 = -(93gl2 + 9 flg 3 l - 9 r g23), 

1 • ? 

T 1,33 = -0<pgi3 + 9^g3i - 9rg33) = rsin 6, 

1 

T 2.00 = -Oog 20 + 9 og 02 - 3 flg 00 ) = 0 , 

1 

T 2.01 = r 2 ,10 = -(3|g20 + 9ogl2 - 9flgoi) = 0, 

1 

T 2.02 = T 2.20 = -( 9 flg 20 + 9 og 22 - 9 eg 02 > = 0 , 

1 

T2.03 = T2.30 = -(d<pg20 + 9og32 - 9eg03> = 0, 

1 

T 2 .ll = “Org21 + 3rgl2 — 3egll) = 0, 

1 

T 2.12 = T 2.21 = -(3flg21 + d r g 22 — dggiz) = — r, 
T2.13 = T2.31 = — (3<pg21 + 9rg32 - 9flgl3) = 0, 

T 2.22 = “Oeg 22 + 9flg22 - 9flg22> = 0 , 

1 

T2.23 = T2.32 = “(3^g22 + 9flg32 - 9flg23) = 0, 

1 ? • 

T2.33 = -(9,?g23 + 9pg32 - 9flg33) = r sm<9 COS0, 

1 

T 3,00 = -Oog30 + 9og03 - d<pg 00 ) = 0 , 

T3.01 = T 3,10 = — (9,g30 + 9 ogl 3 — 9,?goi) = 0, 

T 3.02 = T 3,20 = -(9flg30 + 9og23 - 9 ^g 02 > = 0, 

T3.03 = T3,30 = — (9<pg30 + 9og33 - 9^g03) = 0, 

1 

T 3.11 = -Org3i + 9rgi3 - 9^>gn) = 0, 

1 

T3.12 = T3,2l = -(9flg31 + drg23 - dygiz) = 0, 
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1 . o 

r 3,13 = r 3 ,3i = — (9pg3i + 9 r g33 - d(fgl3) = —7" Sin 9, 

F; 3,22 = -(9flg32 + 9flg23 - 9pg22) = 0, 

1 2 • 
r 3 ,23 = T3,32 = -(9pg32 + 9flg33 - 9<pg23) = -r smS COS0, 

F 3,33 = “(9pg33 + d<pg33 ~ 9<pg33) = 0. (20.17) 

The Christoffel symbols of the second kind for the metric (20.13) can be calculated using 
the definition (9.29), that is, 

Fj to = g Pjr Jte- (20-18) 

The results for the Christoffel symbols of the second kind for the metric (20.13) are 
summarized in the following list: 

F 0 °0 = g° ; F/,oo = g 00 r 0 ,00 = 0, 

F 0 °1 = r?o = g 0J r m = g 00 r 0 ,0i = i/(r), 

r 02 = r 20 = g 0J rj , 02 = g 00 r 0 ,02 = 0 , 

r o3 = r 30 = g 0]r j, 03 = g 00 r 0 ,03 = o, 

F?1 = g 0 J rj,u = g 00 r 0 ,ii = o, 

F?2 = r^=g%M2 = g 00 r 0 ,i2 = o, 
r i3 = r 31 = 13 = g°°Fo,13 = 0, 

r 2 °2 = g 0j rj ,22 = g 00 r 0 ,22 = o, 

r 23 = r 32 = g° J rj,23 = g°°Fo,23 = 0, 

F 3 °3 = g° Jr j,33 = g 00 Fo,33 = 0, 

r oo = g 1; F/,oo = g u ri,oo = v'(r) exp(2v - 2k), 

r oi = r } 0 =g 1 - / r ; - j01 = g n r li01 = o, 
r o2 = rio = g lj r 7i02 = g 11 r lj02 = o, 

F03 = F 3 1 0=g U r 7 -,03=g 11 Fl,03 = 0, 

rJi =g 1 J F i , 1 i =g n r ul = X'(r), 
r i2 = r 2i =g lj r ; - >12 = g n r u2 = o, 
f } 3 = F 3 1 ! =g 1 J r ; - jl3 = g 11 r 1 ,i 3 = o, 

r 2 1 2 = g lj r 7 ,22 - g u ri,22 = — rexp(— 2 A), 

r 23 = F 32 = g y r ; - 23 = g 11 r 1 ,23 = 0 , 
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r 33 = S 1]r j , 33 = g U Ti,33 = ~r sin 2 0 exp(-2V), 
Too = g 2j E/, oo = g 22 r 2 ,oo = 0 , 

r oi = r io = g 2Jr j,oi = g 22 E 2 ,oi = 0, 
r 0 2 = r 20 = g 2jr l,0 2 = g 22 E 2 ,02 = 0, 
r 03 = r 30 = g 2jr l,03 = g 22 r 2 ,03 = o, 

r?i = g 2] rj,u = g 22 r 2 ,n = o 
r? 2 = rf, = g 2j r j:12 = g 22 r 2>12 = 1 

r i3 = r 31 = g 2j F;,13 = g 22 r 2 ,13 = 0, 

r| 2 = g 2; r Jl22 = g 22 r 2 , 22 = o, 

r 23 = r 32 = g 2jr j,23 = g 22 E 2 ,23 = 0, 

r 33 = g 2jr /,33 = g 22 r 2 , 33 = - sin£» COS0 
rgo = g 3j r./,oo = g 33 r 3 ,oo = o, 
r 01 = r i0 = g 3j T;,01 = g 33 r 3 ,01 = 0, 
r 02 = r 20 = g 3j T;,02 = g^.CB = 0, 
r 03 = r 30 = g 3ir j,03 = g 33 r 3 ,03 = 0, 

r?! = g 3; r jiU = g 33 r 3 ,n = o, 

r ?2 = r 21 = g 3jr j,U = g 33 F 3 , 12 = 0, 
r i3 = r 31 = g 3ir j,13 = g 33 E 3 ,13 = 

r 22 = g 3; r 7i22 = g 33 r 3 , 22 = o, 

r 23 = r 32 = g 3j ' r l,2 3 = g 33 E 3 ,23 = ^ = “tfl, 

r 33 = g 3; r 7l33 = g 33 r 3 , 33 = o. 


From the results listed in (20.19) we can make the following conclusions: 


[ r o“J = [r?„] 


[ r °J = [ r ^] = 


1 10 1 20 1 30 

p2 p 2 p2 

1 10 1 20 1 30 
p3 p3 p3 
1 10 1 20 1 30 -I 

pO pO pO — | 
1 11 1 12 1 13 
pO pO pO 
1 21 1 22 1 23 
pO pO pO 
1 31 1 32 1 33 


EE 0 , 


EE 0 , 


( 20 . 19 ) 


(20.20) 


( 20 . 21 ) 
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where [a, /3 = 1, 2, 3). Furthermore, we can calculate 

*0 / = r 0 °0 + *01 + *02 + *03 = °> 


3; ~ 1 30 _r 1 31 

Using the results (20.22) we note that 


*1/ - *10 + *11 + *12 + *13 - v ' + 

*2 j = *20 + *21 + *22 + *23 = cot *’ 

f = rS n + rl, + rf, + Ton = o. 


M - 


1 

QJ 

9 4 

9^4' " 


' 9 r r4 

0 

0 

9« r 4 

9« r 4 

9fl r 4 

= 

0 

% 

0 

. 9,r [j 

9^4 

9. r 4 . 


0 

0 

0 _ 


or 


d P r ij = 0 foiaitp. 


From the results listed in (20.19) we also have 

*12 = *i 3 = *23 = 0 =► 9 r r Ip = 0 for of/ft 


w _ 1 
1 12 — Z’ 


r r3 = r 23 = ° =► %r^ = o fora jtp, 


*f 2 = *?3 = *23 = 0 =» = 0 for “ + P 


(20.22) 


(20.23) 


(20.24) 


(20.25) 


or 


= 0 fora / /3. (20.26) 

The Ricci tensor for the metric (20.13) can be calculated using the definition (19.8), that is, 

R kn = 9„r(. - djT[ n + r p r ] pn - r p C pj . (20.27) 

From the result (12.48) we see that the Ricci tensor is a symmetric tensor Rp,, = R n p and 
that it has only ten independent components. The formulae for the components of the 
Ricci tensor for the metric (20.13) are summarized in the following list: 

Roo = 9 0 r^. - 9 /Fqq + r p Q] r J pQ - r p 0 r J pj , 

R01 = *10 = d r r J oj - 9 ;^ + * 4*4 - *4 * 4 - 

R02 = *20 = 9^4- - 9^4 + r p r j p2 - r p 2 r J pj , 

*03 = *30 = 9,r 4 - 9 jt 4 + rP.r^ - r^, 

*11 = 9 r r 4 - 9 ;r ; n + *4*4 - r* r j pj , 

*12 = *21 = do Ty - 9;-r{ 2 + rf.r; j2 - rf 2 r^, 

*13 = *31 = 9,r{. - dj-r { 3 + r p T J p3 - r^, 
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R 22 = 9*4 - 9,-ri + r p r ] p2 - r p 22 r J pj , 

r 23 = /,- 32 = 9 ,r{. - 9 ; t{ 3 + r p r J p3 - r p 23 r j pjt 

r 33 = 9 ^. - 9 jt { 3 + r 3 j -rj , 3 - r^rG. (20.28) 

The results for all ten individual components of the Ricci tensor for the metric (20.13) can 
be obtained from the above list (20.28) . However, the calculation process can be facilitated 
using some of the general results (20.20) -(20.26). Thus, for a = 1, 2, 3 we may use (20.27) 
to calculate 

R a0 = 9 0 r ij - djr j a0 + r p aJ r J p0 - r^.. ( 20 . 29 ) 

Using here the condition for the static metric do I = 0, we obtain 

Rao = -9^rf 0 - r® 0 r^. - rf 0 r j pj + r p aj r j pQ . ( 20 . 30 ) 

Using here T^ 0 = 0 and r^. = 0 from (20.20) and (20.22) respectively , the result (20.30) 
becomes 

R a0 = r p .r£ 0 = r° 0 rg 0 + r°,rg, + rf 0 r° 0 + r^r“ 0 . ( 20 . 31 ) 

Using here I 'g () = 0, T^ 0 = 0 and = 0 from (20.19), (20.20) and (20.21) respectively, we 
see that all four terms on the right-hand side of (20.31) vanish. Thus, we obtain 

R a 0 = R 0a = 0 (a = 1, 2, 3). (20.32) 


Furthermore, for a, p = 1,2,3 and a / ft, we may use (20.27) to calculate 

R aP - dp r ^ - djiip + r p aj r ] pfi - r p p r j pj . (20.33) 

From (20.24) and (20.26) we see that the first two terms on the right-hand side of (20.33) 
for a ^ p vanish. Using (20.22) we then obtain 


D T-'P T-'J pO T-'J pi p./ p 2 p./ 

~ 1 a] 1 pP 1 ap l 0j 1 ap l lj 1 ap 1 2j‘ 


(20.34) 


Using here = 0 and T ^ = 0 for a / p from (20.21) and (20.25) respectively, we have 




Rap = r p ; r{ 




cote 


aj pP a 


(20.35) 


Thus, the second term on the right-hand side of (20.35) is not equal to zero only for 
( ap ) = (12). Let us now calculate the first term on the right-hand side of (20.35), as follows: 


pP | '/ pF pU pF p 

1 aj pP 1 aO 1 pP ' 1 aajA 


V p0 


^P ptn 

PP’ 


(20.36) 


or 


T-'P T-'J p0 p0 I p(T p0 I pO phi I p(T t-'CO 

1 aj 1 pP ~ 1 a0 A 0 P ' 1 a0 A ap ' 1 aco 1 Op ' 1 aco L op' 


(20.37) 


From (20.19) we see that the only non-zero term of type T^ 0 or T^ is rfg = v\r). Thus, for 
a / p the first term on the right-hand side of (20.37) is zero. Furthermore, using (20.20) 
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and (20.21), we see that the second and third term on the right-hand side of (20.37) are 
also equal to zero. Thus, we have 

Y'P pi per p&> pi pet) . p 2 t-i&) , t-i 3 p&) 

1 aj L p 0 ~ 1 aco 1 <J 0 ~ 1 aw L 1 0 1 aox v 20 1 cuo 1 30 

r l pi I pi p 2 i pi p3 I p2 pi I p2 p2 i p2 p3 

O'! 1 10 • 1 rv?. 1 lfl I 1 ct\ 3 1 lfl I 1 ^1 1 ?./? T- 1 „?! ?fl T- 1 9 


a 2 A 10 1 « 3 A 10 


al A 20 


a2 1 20 I" 1 a3 A 20 


■ p 3 pi I p 3 p 2 i p 3 p 3 
1 al l 30 + 1 a 2 l 30 + 1 t* 3 A 30 - 


Using the result (20.38) we can show by direct calculation that 


pi el p2 p3 p3 cl e2 

1 aj l pfi ~ °ct°p l 13 1 32 — °a b p 


COt 6 


Substituting (20.39) into (20.35) we finally obtain 

Rap = Rpa = 0, (a 7 ^ Ph 


(20.38) 


(20.39) 


(20.40) 


From the results (20.32) and (20.40) we conclude that all off-diagonal components of the 
Ricci tensor are identically equal to zero 


Rkn — Rnk — 0 (k ^ n). 


(20.41) 


Thus, the only nontrivial components of the Ricci tensor are the diagonal components. 
Using the static property of the metric do^ J kn = 0 and the results ( 20 . 20 ) -( 20 . 26 ), they can 
be calculated as 

Roo = -3 r ri 0 -r 0 1 0 r{. + 2ri 0 r? 0 

= v" + v'x’ — v' 2 exp(2v — 2/) 

Rn = d r r{j - - rhrij + (r? 0 ) 2 + (r^) 2 

+ (rL) 2 + ( r i 3 ) Z = - vT + v' 2 - h' 

r 22 = a 0 r J 2j - d r r\ 2 + 2r’ 2 r 2 2 + (rf 3 ) 2 - r’ 2 r 2 . 

= (l + rv' — rX r ) exp(— 2X) — 1 

J? 33 = -9 r r^ 3 - a fl r 2 3 - r’ 3 r 2 . - r| 3 r{. + 21^3 

+ 2r 33 r 23 = sin2 S [(1 + rv' - rX') exp(-2X ) - l] (20.42) 

From the results ( 20 . 42 ) we see that i? 33 = R 2 2 sin 2 0. As sin 2 f) is in general different 
from zero, substitution of these two components of the Ricci tensor into the vacuum 
gravitational field equations R kn = 0 gives the same differential equation for v and X. Thus, 
the vacuum gravitational field equations R kn = 0 give only three independent equations 
for v and X, that is, 

-v" + v’x' - v' 2 - -v' = 0, 
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v" - v'x! + v' 2 - -X' = 0, 
r 

(1 + rv' - rX') exp(— 2A) = 1. (20.43) 

Adding together the first two equations in (20.43) gives 

2 d 

-- (v' + V) = 0 => -f- (v + M = 0. (20.44) 

r ’ dr 

From (20.44) we see that the quantity v+X must be a constant. On the other hand, for large 
values of radial coordinate r the space-time is approximately flat and both v and X tend to 
zero as r -> oo. Thus, the constant v + X must be equal to zero, and we have 

v + A = 0 => X — —v. (20.45) 


Substituting (20.45) into the third of (20.43), we obtain 

(l + 2rv') exp(2v) = [rexp(2v)] = 1. 
Integrating (20.46) we obtain 

rexp(2v) = r — r^, 


(20.46) 


(20.47) 


where the integration constant tq is called the gravitational radius of the body. Thus, we 
obtain 

r G 

goo = exp(2v) = 1 , 

gn = exp(2G = (l - 1 . (20.48) 

The gravitational radius of the body is defined using the Newtonian limit (18.38) with 

(20.12) as 


goo 


2 <p 

1 +-£ = 1 - 


2GM 
c 2 r 


Comparing (20.49) with the first of (20.48) we find 

2 GM 

r G = — 2~ ■ 

C z 

Thus, the final result for the Schwarzschild space-time metric is given by 
ds 2 = ^1 — — j c 2 df 2 — (l — — j dr 2 — r 2 (&0 2 + sin 2 0di p 2 ^ . 


(20.49) 


(20.50) 


(20.51) 


From the Schwarzschild solution (20.51) we conclude that the empty space outside a 
spherically symmetric distribution of matter can be described by a static metric. Fur- 
thermore, it should be noted that the solution (20.51) is also valid for moving masses as 
long as the motion preserves the required symmetry, for example, a centrally symmetric 
pulsation. We note also that the metric (20.51) depends only on the total mass of the 
body which is a source of the gravitational field, just as in the case of the Newtonian 
theory. If we put dt — 0 in the metric (20.51) we obtain the three-dimensional space with a 
line element 
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dZ 2 = ^1 — — ^ dr 2 + r 2 (d9 2 + sin 2 6*di p 2 ^ . (20.52) 

As the Schwarzschild metric tensor g/ at is not time dependent, the distances can be defined 
over a finite portion of space and the integral of the element of the spatial distance dl along 
a space curve has a definite meaning. In other words it is possible to split the space-time 
into the space and time with definite meaning. If we can turn the mass M of the source 
down to zero, the metric (20.51) reduces to the pseudo-Euclidean metric of the flat space- 
time in the spherical coordinates 

ds 2 = c 2 dt 2 - dr 2 - r 2 (d 9 2 + sin 2 9d<p 2 ) . (20.53) 

Turning the M on again, we introduce a distortion into both the four-dimensional space- 
time continuum and the three-dimensional space itself and neither of them is flat any 
more. The level of distortion is proportional to the dimensionless quantity r G /r. In the flat 
space-time described by the metric (20.53) the radial coordinate r is the measure of the 
radial distance from the origin of the coordinates. In the curved space-time it is no longer 
the case and r is just a space coordinate which does not measure the radial distance from 
the origin. From the line element (20.52) we obtain the square of the infinitesimal radial 
distance d R for do = dtp = 0, as follows: 

dR 2 = (l - — ) 1 di 2 . (20.54) 

Thus, we see that the actual radial distance increment dR is larger than the coordinate 
differential dr, that is, d R > dr. The distance between two points r\ and r 2 is then 
obtained as 



= |y r(r - r G ) + r G ln(Vr + -Jr - r G )]^ >r 2 - n. (20.55) 

From the result (20.55) it can be shown that for small r G /r the distance between the two 
points n and r 2 tends to r 2 — r\ . If we take the sphere in space with r = Constant, then the 
three-dimensional space metric (20.52) becomes the metric of a two-dimensional sphere 
of radius r embedded in the Euclidean space, that is, 

d/ 2 = r 2 (d0 z + sin 2 Odip 2 ^ . (20.56) 

The infinitesimal tangential distances are therefore the same as in the Euclidean space 

dZ = r-J dO 2 + sin 2 9dip 2 . (20.57) 

Let us now consider the element of the proper time dr given by (18.5) as follows: 

dr = y/goo df = J 1 — — dr < df. (20.58) 

The proper time between any two events occurring at the same point in space is then 
given by 
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r = 



h~ h- 


(20.59) 


Thus in the curved space-time near the massive sources of the gravitational field there is 
a slowing down of time, compared to the time that would be measured in the pseudo- 
Euclidean limit at the infinite distance from the sources of the gravitational field, that is, 
when r tends to infinity. 

It should be noted that the Schwarzschild metric (20.51) becomes singular at r— r G , 
where goo = 0 andgu = — oo. However, for most of the observable bodies in the universe the 
gravitational radius lies well inside them, where the Schwarzschild metric is not applicable 
anyway. For example for the Sun the gravitational radius is r G = 2.9 km and for the Earth 
the gravitational radius is r G = 0.88 cm. Furthermore, the singularity of the Schwarzschild 
metric can be shown to be more a consequence of the choice of the space-time coordi- 
nates than of the space-time itself. Nevertheless for a few bodies in the universe that are 
actually smaller than their gravitational radius, some physically interesting things do hap- 
pen at the boundary r=r G . For example, matter and energy may fall into the region where 
r < r G but neither matter nor energy (including the light signals) can escape from the re- 
gion where r < r G . Such a region is called a black hole and will be discussed in Chapter 21. 



Applications of the Schwarzschild 
Metric 


In Chapter 20 we have derived the static Schwarzschild solution of the gravitational field 
equations for a static spherically symmetric field produced by a spherical mass M at rest. 
In this chapter we discuss two applications of the Schwarzschild solution to explain two 
physical phenomena which cannot be explained within the framework of the classical 
Newtonian theory of gravitation. 


21.1 The Perihelion Advance 


According to the Newtonian theory of gravitation, the orbit of a planet around the Sun is 
a closed ellipse with the Sun at one of the two foci. Thus the point, which is called the 
Perihelion and where the planet is closest to the Sun, is fixed. However, the experimental 
evidence shows that the perihelion of the planets is not fixed, but gradually rotates around 
the Sun. This rotation, although very slow, is cumulative and can be measured over a long 
period of time. In the classical mechanics the Lagrangian of a planet in the central field of 
the Sun is given by 


L = £ k - ni(p(r) = + 


( 21 . 1 ) 


The conserved energy of the planet is 


3 L mi/ 

E = ir - L = 

dv 01 2 


GM m 


= Constant. 


From (21.1) we may write in the spherical coordinates 


L = m 


^ (r 2 + r 2 ^ 2 + r 2 sin 2 dtp 2 ^ 


GM' 


( 21 . 2 ) 


(21.3) 


where the dots denote the time differentiation of coordinates. Since the classical motion 
of the planet in the central field of the Sun is confined to the plane of the orbit, which we 
take to be the equatorial plane of the spherical coordinates, we have 9 =n/2 and 0 = 0. 
From (21.3) we then obtain 


L 



(f 2 + /V) + 


GM' 

r 


(21.4) 
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The Lagrangian equation with respect to the angular variable tp is given by 

d /9L\ _ 9L 
At \dtp / dtp 

Substituting (21.4) into (21.5) we obtain 

A( mr ^) = 0 , 

or 


(21.5) 


( 21 . 6 ) 


r 2 (j> = h = Constant =>■ r 2 tp = — , 


mr <p 


m 


(21.7) 


where h is the conserved angular momentum of the planet. Substituting (21.7) into (21.2) 
in the polar coordinates, we obtain 


m 

E =1 


t 2 + 


h 2 2 GM' 


From (21.8) we may write 


2 E h 2 2 GM 
r = ^ — 7t + . 


m oi‘ r 


,2 r 2 


Dividing now r z from (21.9) by r <p z from (21.7), we obtain 

1 / dr\ 2 _ m 2 /2U h 2 2 GM\ 


r 4 \ dtp ) h 2 \ m m 2 r 2 


’ 


or 


_Atp \r ) 


2 2 mE /1\ 2 2 GM m 2 (\ 


h 2 


- - + 


h 2 


(O' 


Introducing here a new variable w = 1/r, we obtain 


du\ 2 

d v) 


+ u = 


2m£ 2 G Mm 2 


h 2 


h 2 


-u, 


or 


/ diA 2 mR G 2 M 2 m 4 / GMm 


\dtp J 


h 2 


h 4 


V 1j 2 


Introducing here the following notation: 


e 2 2m£ G 2 M 2 m 4 1 GMm 2 

+ 


where 


e = 1 + 


/i 2 fe 4 

2Eh 2 


h 2 ’ 


G 2 M 2 m 3 ’ 


P = 


ij 2 


GMm 2 ’ 


(21.8) 


(21.9) 


(21.10) 


( 21 . 11 ) 


( 21 . 12 ) 


(21.13) 


(21.14) 


(21.15) 


we may rewrite (2 1 . 13) as follows: 
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/d u \ 2 e 2 


1 

u 

P 


\ di p J 

The well-known solution of the differential equation (21.16) is given by 

1 + e cos <p 


u = 


(21.16) 


(21.17) 


The validity of (21.17) is easily confirmed by direct substitution. Using (21.17) we can 
calculate 


- - u ) = —y cos" <p, 


dw\ 2 e z . 2 

d^j =^ Sm * 


(21.18) 


Substituting (21.18) into (21.16) and using cos 2 ip + sin 2 <p = 1 we obtain the identity. Thus, 
(21.17) is a correct solution of the differential equation (21.16). Using here u = 1/r we 
finally obtain 

r = . (21.19) 

1 + e cos <p 

It is the equation of an ellipse in the polar coordinates. The length of the major axis of this 
ellipse, denoted by 2a, is given by 

p p 2p 


2 a = r{tp = 0) + r((p = n) = 


1 + e + 1 - e 


1 


(21.20) 


(21.21) 


Thus, we may express the parameter p as 

p = a ^ 1 — e 2 ) . 

Thus, according to the Newtonian theory of gravitation the orbit of a planet around the 
Sun is indeed a closed ellipse with the Sun at one of the two foci and the perihelion is fixed. 
Let us now consider the motion of the planet in the Schwarzschild space-time within the 
framework of the general theory of relativity. The equations of motion are the geodesic 
equations (11.25), that is, 

d 2 x" „ dx l dx k 
d.s' 2 lk ds ds 

Using the non-zero Christoffel symbols of the second kind in the Schwarzschild metric 
(20.19) we obtain the four equations of motion 

d 2 f 
ds 2 


+ r tt— — = ° (k,l,n = 0, 1,2,3). 


( 21 . 22 ) 


u "t n df dr 

^ +2r -ra = 0 ’ 


d 2 r 


dfV 


ds / 


dr\ 


d z 9 

ds 2 


dr d6 
12 ds ds 


tt + 2rf? — — + r| 3 


ds J 

tr- 


do 


ds 


^ + c 2 ri n ( " ) + rj, ( “ ) + rj, ( ^ ) + r’ 3 ( ^ = o, 


ds 7 


d2( P , or-3 drd( P , n p 3 dd d( P 


ds 2 


+ 2r i3 T 7T + 2r 23 -T- ~r = °- 
ds ds ds ds 


(21.23) 
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In (21.23) the factor of two appears before each of the off-diagonal terms as they appear 
twice in the sum due to the symmetry of the Christoffel symbols of the second kind with 
respect to its two lower indices. The Schwarzschild metric is a spherically symmetric 
metric and the motion of the planets around the Sun is still confined to the plane of the 
orbit, which we may take to be the equatorial plane of the spherical coordinates. Thus, we 
still have 0 = n/2 and 6 = 0. The third of the equations of motion (21.23) for the angular 
variable 6 becomes therefore trivial and the other three equations are reduced to the 
following equations: 





( 21 . 24 ) 


Substituting the actual values for the Christoffel symbols of the second kind from (20.19) 
in the first and the third of (21.24) we obtain 




( 21 . 25 ) 


or 




( 21 . 26 ) 


Thus, we obtain two constants of motion 



2 E 

1 -| tz = Constant, 

me 


n U(W n 

r — = = Constant. 


( 21 . 27 ) 


ds me 


The form of the constants on the right-hand side of (21.27) is chosen in such a way to 
secure the correct classical analogy. From (21.27) we may write 




( 21 . 28 ) 
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Using now the result for the Schwarzschild metric (20.51) with 6 = tt/2 and 6 = 0, we have 


ds 2 = (l - y) c 2 dt 2 - (l - y) 1 dr 2 - Sdip 2 


or 


-(‘-r m -m) 

Substituting (21.28) into (21.30) we obtain 


(21.29) 


(21.30) 


-("?)"( 
('-?) 


2 E 
me 2 
. 2 




h 2 


or 


re 


2 E 


1- — = 1 + 

r me * 




Using (21.27) we may calculate 

dr 

r 2 di p ' ds 

Substituting (21.33) into (21.32) we obtain 


_ = J_ dr ,d<p 
ds 1-2 ' 


I; d /I 
me di p 


h 2 


d 

di p 


-,2 


h 2 


+ — + 


fc 2 


re 


(21.31) 


(21.32) 


(21.33) 


(21.34) 


Using here the result (20.50) for re and introducing a new variable u = 1/r, we obtain 


li 2 


/ d(A 2 ? 

W +W 


2E 2 CM h z r G , 

T + O W + Q — o W 


rac^ 


or 


dw\ 2 

d^J 


+ ir = 


2m£ 2 GMm 2 


h 2 


h 2 


-u+ r G u . 


Using again the result (20.50) for r G , we finally obtain 


/ dw\ 2 

W 


+ u = 


2 mE 2 GMm 2 2 GM 


h 2 


h 2 


- u + 


u\ 


(21.35) 


(21.36) 


(21.37) 


Comparing (21.37) with the Newtonian equation (21.12) we see that the only difference is 
an additional non-linear term proportional to u 3 which vanishes for c —> oo. This term can 
therefore be considered as a small perturbation of the Newtonian equation (21.12) which 
introduces the general-relativistic corrections to the Newtonian results. Differentiating 
both sides of (21.37) with respect to the angular variable <p, we obtain 


du ( d 2 u \ d u ( 

I -7 + u ) =2 —— ( 


dip \d<p 2 


du / GM m 2 3 GM 


dip \ h 2 


(21.38) 
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If we disregard the solution du/dip = 0, which represents the circular orbit with 
r = Constant, we obtain the non-linear differential equation for u in the form 


d 2 i 

dtp‘ 


+ u = 


GMm 2 3 GM 


h 2 


u 2 . 


The solution to the corresponding Newtonian linear equation 


d 2 u (0) 

dtp 2 


+ u m = 


GM m 2 
h 2 ’ 


is given by (21.17), that is, 


M (0) _ 1 + g COS <p 


(21.39) 


(21.40) 


(21.41) 


Using the perturbation method, we may construct an approximate solution of the non- 
linear equation (21.39) in the form 


w = u (0) + u (1) , 


« u< 0) . 


Using (21.42) with (21.41) we may write 


d 2 u d 2 w (0) d 2 w (1) d 2 w (1) 


and 


dip 2 dtp 2 


u = u (0> + iC' = u yL ’ -\ — cos® -| — . 

P P 


dtp 2 


dip 2 


,d) 


v(D 


— COS ip, 

p 


1 


(21.42) 


(21.43) 


(21.44) 


Combining the results (21.43) and (21.44) and using the definition (21.14) we obtain 

GM m 2 


d 2 u _ d 2 u m 

dip 2 dip 2 


+ u m + 


h 2 


(21.45) 


Substituting the result (21.45) into (21.39) and putting u = u (()> in the non-linear term, we 
obtain the equation for the perturbation u <]) in the form 

d 2 u {1) m 3 GM 


dip 2 


+ u a) = 


(u (0> ) . 


Substituting the result (21.41) into (21.46) we obtain 


d 2 u a) 

dip 2 


+ U 0) = 


3 GM 
c 2 p 2 


(1 + e cos ip ) 2 , 


or 


d 2 u a) n> 3 GM 6 GM 3 GM 7 , 

+ u =T2Tr + 72Tr ecos ^ + T2Tr^ cos < p - 


dtp 2 




p 


c*- p ^ 


(21.46) 


(21.47) 


(21.48) 


Considering orbits with small eccentricity e the term of the order e 2 can be neglected. The 
contribution from the constant term is negligible as well. The only term that produces an 
observable effect is the one proportional to cosy with the contribution which increases 
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continuously after each revolution. Dropping the constant term and the term proportional 
to e 2 , we obtain 

d 2 u m 


dtp 


m 6 GM 
+ u' ’ = „ „ ecos tp. 

Z q<L 


The solution of the differential equation (21.49) is given by 


,(i> _ 


3 GM 


u = _ „ etpsmtp, 


(21.49) 


(21.50) 


which is easily shown by direct substitution into (21.49). Substituting (21.41) and (21.50) 
into (21.42), we obtain 

1 + e cos tp 3 GM e 


u = 


H t. tp smcp. 

p p 


(21.51) 


(21.52) 


Let us now introduce the increment A tp as follows: 

3 GM 3 GM 

where we have used the result (21.21) for the parameter p. Thus, we may rewrite (21.51) as 
follows: 

1 + e cos <p + e A<p sin tp 


u = 


(21.53) 


Using now the trigonometric formula 

cos {up — Ai p) = cos (p cos \(p + sin ip sin Ai p, (21.54) 

and the approximations for the trigonometric functions of a small angle A <p, given by 


cosAip^l, sinAip^Aip, 

(21.55) 

we obtain 


cos tp + A<p sin <p « cos (tp — A <p) . 

(21.56) 

Substituting (21.56) into (21.53) , we finally obtain 


1 + e cos (tp — At p) 

(21.57) 


From (21.57) we see that while a planet moves through an angle <p, the perihelion advances 
for a fraction of the revolution angle equal to 

Ai p 3 GM 


i p c 2 a (l — e 2 ) 

For a complete revolution [<p = 2: r) the perihelion advances for an angle 

6? tGM 


Ai p = 


° 2 a (1 - e2 Y 


(21.58) 


(21.59) 
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For the planet of Mercury the theoretical result for the relativistic perihelion shift is equal 
to A (p = 43.03 seconds of arc per century, while the observed perihelion shift is equal to 
A cp = 43.11 ± 0.45 seconds of arc per century. The perihelion shift of the planet Mercury 
could not be explained in the Newtonian theory of gravitation. This remarkable agreement 
of the theoretical result from the general theory of relativity with the observational data 
was the first major experimental confirmation of the theory. 


21 .2 The Black Holes 


In Chapter 20 we mentioned the singularity of the Schwarzschild metric (19.51) at the 
gravitational radius of the spherical body r=r G and the possibility of the existence of 
bodies in the universe which are actually smaller than their gravitational radius r G . The 
region where r <r G around such bodies is called the black hole. A number of physically 
interesting phenomena occurs at the boundary r — r G . For example, we have argued that 
matter and energy may fall into the region r <r G of a black hole, but neither matter nor 
energy (including the light signals) can escape from that region. In order to prove this 
assertion, let us consider a particle falling radially into a black hole with a radial velocity 
u 1 = dr Ids. As the particle is falling radially, we have u 2 = u 3 = 0. The motion of the particle 
is described by the geodesic equations (11.25), that is, 


d u n 
ds 


+ T'l k u l u k = 0 . 


(21.60) 


Using u 2 = u 3 = 0 and the results for the Christoff el symbols of the second kind (20.19), 
we may write the temporal equation of motion as follows: 

di«° 


or 


ds 

du° 


= —T° k u l u k = — 2Fj 0 u l u° , 


dv dr n di> n 
= -2— — u° = —2—ir, 


(21.61) 


(21.62) 


ds dr ds ds 

where the factor of two appears because the term proportional to T° 0 = T^ appear twice 
in the sum. Thus, we obtain 

du° 


ds + u ° = ex P( — 2v) ^exp(2v)i< 0 ^ = 0. 
Equation (21.63) can be integrated to give 

exp(2v)u° = goou 0 = K = Constant, 


(21.63) 


(21.64) 


where K is the integration constant which is equal to the value of goo at the point where 
the particle starts to fall towards the black hole. Using the identity g^ n ii k u n = 1, we may 
also write 

1 = gknU k U n = good* 0 ) 2 +gll((d) 2 . 


(21.65) 
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Multiplying by goo and using (21.64) as well as googn = — 1, we obtain 

goo = (goo) 2 (w 0 ) 2 + goognCw 1 ) 2 = K 2 - ( u 1 ) 2 , 
or 


re 


(W ) =K‘ L - gOQ=K - \+ -y. 

For the radially falling body we have u l < 0, and we may write 

Using (21.64) and (21.68) we may calculate the ratio df/dr as follows: 

d t u° K /„, _ r G N-V 2 


dr c u l 


egoo 


~ 1 + f) 


or 




(21.66) 


( 21 . 67 ) 


(21.68) 


( 21 . 69 ) 


( 21 . 70 ) 


If we now assume that the particle falling radially into a black hole is close to the 
gravitational radius r G , then we may write r = r G + e with e <<c r G . Equation (21.70) then 
becomes 

- 1/2 


dr 
dr 

Using now the approximation 


in (21.71), we obtain 


or 


Integrating (21.74) we obtain 


K 

( e V 

-1 

2 ( e \ 1 

— 

1-1 + — ) 

X 

K 2 - 1 + 1 + — 

c 

V r G ) 


V r G J 


1 + — 
re 


-l 


1 - — , 
re 


d t ^ 

K r G 

K 2 6 

■ 1/2 . r 6 

dr 

ce 

r G_ 

ce 


d t ^ 

r G 



dr 

c (r - 

re) 


-r G In (r - r G ) + Constant. 


( 21 . 71 ) 


( 21 . 72 ) 


( 21 . 73 ) 


( 21 . 74 ) 


( 21 . 75 ) 


From (21.75) we note that as r r G we have t —>■ oo. Let us now consider an observer 
traveling with the particle. The proper time measured by the observer in its own rest frame 
is given by (18.5), that is, 


dr = -ds = ^goodr. 


( 21 . 76 ) 
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Thus, we may write 



( 21 . 77 ) 


or 



( 21 . 78 ) 


If we again assume that the observer traveling radially into a black hole is very close to 
the gravitational radius r G , then we have r=r G + e with dr = de. Equation (21.78) then 
becomes 



( 21 . 79 ) 


or 



( 21 . 80 ) 


Integrating from the starting point e to the point e = 0 where the observer reaches the 
gravitational radius r G , we obtain 





( 21 . 81 ) 


or 



( 21 . 82 ) 


r 


From the result (21.82) we conclude that the observer reaches the point r— r G after the 
lapse of a finite proper time according to his own clock. Thus, the singularity at r — r G is not 
a real unphysical singularity, but merely a coordinate singularity which is a consequence 
of the choice of the coordinate system. Let us now assume that, during the radial fall, the 
observer is sending light signals to a distant counterpart at precisely regular time intervals 
according to his own clock. Using the definition of the differential of the proper time (18.5), 
the differential of the coordinate time is given by 



( 21 . 83 ) 


and from the point of view of the distant receiver the light signals are red-shifted by a factor 
proportional to 


- 1/2 


- 1/2 


Chapter 21 • Applications of the Schwarzschild Metric 193 


Thus, although the traveling observer passes the point r=r G after the lapse of a finite 
proper time, from the point of view of the distant receiver the time intervals between light 
signals become longer and longer as the traveling observer approaches the point r— r G . 
The distant receiver never sees the traveling observer after his passing below the point 
r=r G , which is in line with the initial assertion that neither matter nor energy (including 
the light signals) can escape from the region with r < r G . 

The lack of communication with the outside world is a basic property of black holes. 
The boundary r = r G is called an event horizon. Just as the curved earth creates a horizon 
limiting the range of vision of an ocean navigator, the strongly curved geometry in the 
vicinity of a black hole creates an event horizon hiding the space-time of the interior of 
the region with r < r G from the external observer. Nevertheless, the black holes do exert 
influence on the external observers by their gravitational effects on external bodies arising 
from the Schwarzschild metric for r > r G . 

Another interesting feature of the black holes is that inside the region r < r G there 
can be no static bodies. Only the dynamic bodies can exist inside the region r < r G . This 
property is obvious from the unphysical negative result for the square of the world line 
element of a body at rest, for which we have dr = d6 = dip — 0, that is, 

ds 2 = c 2 dr 2 = c 2 ^1 — — ^ df 2 < 0. (21.85) 

At this point we want to examine how bodies move inside a black hole. In particular it is of 
interest to find out if they are moving towards or away from the central mass. We first note 
that by virtue of the principle of invariance of the speed of light, the world line element ds 
of a light ray is always equal to zero. The equation ds = 0 defines the two light cones. As an 
illustration consider a pseudo-Euclidean metric described by the Descartes coordinates, 
where dy = dz = 0. The light cones are then defined by 


ds 2 = c 2 df 2 — dx 2 = 0, 

(21.86) 

Xq- Xp = ±C(fg - tp). 

(21.87) 


Equation (21.87) is the equation of the cross section of two cones with the y-plane or 
the z-plane. All world lines of a light ray in the four-dimensional space-time, passing 
through an arbitrary world point P, must lie on these two light cones where ds = 0. All 
other permissible world lines of arbitrary bodies with ds 2 > 0 must lie within the two light 
cones, since otherwise the slope of the actual world line would be larger than c indicating 
a body moving faster than the speed of light with ds 2 < 0. All world lines within one of the 
two cones point into the future compared to the world point P, while all world lines in the 
other of the two cones point into the past compared to the world point P. 

As we have argued before, if we want to explain how bodies are moving inside a black 
hole, the usual set of four coordinates (i, r,0,<p) is not adequate and we need to introduce 
a more suitable new set of coordinates. The simplest way to achieve this goal is to keep the 
three spatial coordinates and just introduce a new time coordinate w, defined by 
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r G , 

w = t H In 


r 

1 

re. 


( 21 . 88 ) 


From (21.88) we obtain 

dr = d iv — — ( 1 — — ) dr. (21.89) 

cr \ r / 

The square of the original coordinate time differential dt as a function of the new 
coordinate time differential dw defined by (21.89) is then given by 

dr 2 = die 2 — f i _ — i dwell- 
er \ r / 


+ Al 1- ?) ^ (2L90) 

On the other hand for the light cones of a radial motion, where we have d6 = dip = 0, the 
result for the metric gives 

ds 2 = (l - y) c 2 dt 2 - (l - — ) 1 dr 2 = 0, (21.91) 

or 

dr 2 = (l - y) 2 dr 2 . (21.92) 


Substituting (21.92) into (21.90), we can remove the dependency on dt 2 and obtain a 
quadratic equation in dw/dr as follows: 




= 0 . 


The two solutions of the quadratic equation (21.93) are given by 


dw 

dr 




or 


dw = _l dw = 1 / + rq\ / _ rq\-i 
dr c’ dr c \ r ) V r / 


(21.93) 


(21.94) 


(21.95) 


The first of (21.95) shows that outside the black hole for r> r G some world lines have a 
decreasing coordinate distance r with increasing coordinate time w, that is, the light rays 
move towards the central mass. The second of (21.95) for r> r G shows that some world 
lines have increasing r with increasing coordinate time w, that is, the light rays move away 
from the central mass. On the other hand inside the black hole for r <r G both equations 
(21.95) show that ah world lines have decreasing coordinate distance r with increasing 
coordinate time w, that is, the light rays always move towards the central mass. Based on 
the earlier conclusion that ah other permissible world lines of arbitrary bodies with ds 2 > 0 
must he within the two light cones, we see that ah matter and energy (including the light 
signals) within a black hole for r <r G can only move towards the central mass and can 
never escape from the region where r < r G . 
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The above discussion of the properties of black holes would be only academic unless 
there were reasons to believe that such objects exist in the universe. It is today generally 
believed that the black holes do exist and that they are created by gravitational collapse in 
the final stage of the evolution of massive stars with a mass larger than 10 times the solar 
mass. This type of black holes is expected to have a mass in a range of 2-3 solar masses. 
Some stellar objects that may be candidates for this type of black holes have already been 
studied. The supermassive black holes comprising thousands, millions or billions of solar 
masses may also exist. It has also been suggested that in the early stages of the creation 
of the universe, due to a high density of the hot matter, some small material objects could 
have been squeezed sufficiently to form the so called mini-black holes. 




Gravitational Waves 



22.1 Introduction 

The relativistic theory of gravitational fields includes the possibility of propagation of 
gravitational waves with a finite speed, the speed of light (13.1), in source-free space, 
ft turns out that the linearized gravitational field equations (19.96) allow for wave-like 
solutions that represent the propagating gravitational waves. 


22.2 Linearized Einstein Field Equations 

Let us first consider the non-linear held equations ( 19 . 96 ) in the form 

Rkn-\gknR=-^^T kn (k,n = 0,1, 2, 3). (22.1) 

Equation ( 22 . 1 ) determines future values of the metric tensor g mn from given initial 
values. Relativistic theory of gravitational fields is invariant under any possible coordinate 
transformations 

x m z m ( x k } {ki n = 0; 1; 2 , 3 ), ( 22 . 2 ) 

where z m = z m (x k ) are invertible and differentiable functions with differentiable inverses. 
Under the transformations ( 22 . 2 ), the metric tensor g m n(x k ) transforms as follows: 

gmn(X k )^ g mn (Z k )= —— gpq(X k ). ( 22 . 3 ) 

In order to linearize ( 22 . 1 ), we first assume that there exists a reference frame in which, in 
a sufficiently large space-time region, we can write 

gmn(X k ) = limn + hmn(X k ), \h mn \ << 1 , ( 22 . 4 ) 


where rj mn is the metric tensor of the pseudo-Euclidean space-time with components 
displayed in the matrices (13.9) and (13.12), that is, 


[r)mn\ = b mn ] = 


10 0 0 
0-100 
0 0-10 
0 0 0 -1 


( 22 . 5 ) 


In the linear theory based on the approximation (22.4), we assume that the indices of all 
tensors are lowered and raised using i] mn and if" 1 , respectively. The correction arising 
from the presence of h mn and h mn , respectively, can be neglected as being of higher order 
in the perturbation tensor h mn . Thus, we can calculate 
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g mn = r, mk gkp ri pn = rj mk (ri kp + h kp )r,P n 


= r, mk + r, mk h kp r,P n 


= q mn + h mn . 


( 22 . 6 ) 


The components of the covariant perturbation tensor h km are as of yet unknown func- 
tions of the space-time coordinates, that can be displayed as elements of the following 
matrix: 


[hmn\ = [. l mn ] = 


hoo ho\ ho 2 ho3 
hio h u h\2 h\3 

Il20 Il2l h22 h23 

h-30 h 31 h,32 h33 


(22.7) 


The components of the corresponding contravariant tensor h km can be calculated using 
the following matrix equation: 

[h mn ] = [n mk ] [h kp ] [, 7 P" ] 


1 

0 

0 

0 ' 


hoo 

hoi 

ho2 

ho3 


" 1 

0 

0 

0 ' 

0 

-1 

0 

0 


hio 

/in 

hi2 

hi 3 


0 

-1 

0 

0 

0 

0 

-1 

0 


ll20 

h 2 i 

h 22 

1*23 


0 

0 

-1 

0 

0 

0 

0 

-1 


_ 1*30 

1*31 

h 32 

1*33 _ 


0 

0 

0 

-1 


hoo 

-hoi 

~ho2 

—ho3 

h\o 

hi i 

hi2 

hi3 

h20 

h2i 

h 22 

h23 

h30 

h 3 i 

h32 

h33 


where we note that the order of the tensors in (22.6) has been chosen to reflect the correct 
order of matrices in the corresponding matrix product (22.8). While the order of matrices 
in a matrix product is essential for the correct result, the order of tensors in a tensor 
product can be chosen arbitrarily, as long as all indices are correct. 

In the linear approximation (22.4), the coordinate transformations (22.3) become the 
pseudo-Euclidean Lorentz transformations (13.29), and we can rewritte (22.3) as follows: 

gmn(X k ) g' mn (Z k ) = Af„ gpq(X k ) A„, (22.9) 


or 


Vmn V ) — A m l] P q A n + A m h p q(X ) A n 

= tan + h p q(X k ) A^, 

such that the perturbation tensor h mn transforms as 

Km ( z ^ = A m Kq(X k ) A^. 


(22.10) 


(22.11) 


In (22.10) we used the result 

A m Vpq A n — 7 1mn j^A^j [ ! ?pq] = [ten] > (22.12) 

which is easily verified using (13.29) for l\ p m and as well as (13.9) for rj p q, and performing 

the matrix multiplications in given order. 
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As a next step, we now use the definition (9.28) with (22.4) to calculate the Christoffel 
symbols of the first kind 


^ \dp8mn + dn&pm dmgnp) 

= 2 ( dphmn + 4,7 hp m — d m hnp) , 


(22.13) 


where we note that t) mn are components of a constant tensor that do not contribute to the 
result (22.13). Christoffel symbols of the second kind are then obtained by means of the 
definition (9.29), as follows: 


p rn 

1 np 


= g mk r k ,np = \ (r) mk + h mk ) (d p h kn + 3 nh pk - d k gnp) 


— ~ r \ m (dphmn H~ dnhpm ~ dk^np) + 0[(h mn ) 2 ] f 


(22.14) 


where 0[(h mn ) 2 ] denotes the terms of at least second order in the small perturbation 
\h mn \ << 1, that can be neglected in the present context. Using further the definition 
(12.20), we can calculate the linearized curvature tensor 


T)k O pfc o pAl _| _ pZ pAr pZ pZr 

ri mnp ° nL mp °P l mn ' 1 mp l In 1 mn l lp> 


(22.15) 


where we note that the products of Christoffel symbols (22.14) are always of order 
0[(h mn ) 2 ] and can be neglected, such that 


Rmnp ~ dn^mp — dpT mn + 0[(h mn ) 2 \ ~ d n T mp — d pf n 

The Ricci tensor (12.44) then becomes 

Rmn — $mnp = ^n^mp ^p^mn T 0[(h m n ) L 
or, neglecting the higher order perturbations, 

Rmn — - ^ ^ n (dphkm T dm^kp ^k^tnp) 

— 2^ {^ n ^km T ^rn^nk ^k^mn) 

— -(dndphm + ^n^m^P^^pk ~ ^n^P^mp 
—dpd n h^n ~ dpdmhn T ^P^^p^k^mn) 


(22.16) 


(22.17) 


or 


Rmn — — ~ (^dpdnhm T ^p^m^n ~ dn^m h — , 

where we introduced the notation 

□ = if k d p d k = t] pk d p d k = Jr ^ - V 2 , 

h = if k h pk = ii pk h pk . 

Contracting again with rj mn , we obtain the Ricci scalar (12.50), as follows: 


(22.18) 

(22.19) 

(22.20) 

( 22 . 21 ) 
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R = r, mU Rmn 

= -\ (r, mn dpd n h?„ + v mn dpd m < - r, mn d m d n h - ar, mn h mn ) ( 22 . 22 ) 

Interchanging the dummy indices m ^ n in the first term in the bracket on the right-hand 
side of (22.22) and using symmetry of the pseudo-Euclidean metric tensor (j i mn = if im ), 
we obtain 

R=~ \ ( 2 ii mn d p d m K - 2 Dll) = -3 p d k hP k + Uh. ( 22 . 23 ) 

Substituting the results (22.18) and (22.23) into (22.1) gives the linearized Einstein field 
equations in the form 

Rjnn — 2 SmnR = ~ ^3p^n^m T 3p3,n^n — V 9„ ; 1 1 

— Dhmn ~ limn dpdk hR k + limn Oil) — . Inin- ( 22 . 24 ) 

Equations (22.24) are the equations for the potentials of the gravitational field, since the 
components of the metric tensor g mn define the potentials of the gravitational field, as 
indicated in (18.38). 

As in the case of the electromagnetic fields, the components of the gravitational 
potentials h mn are not uniquely defined. The uniquely defined gravitational field forces, 
being the observable physical quantities described by means of the space-time curvature 
tensor (22 .16), are therefore invariant with respect to the following gauge transformations: 

hmn h mn = hmn ~ dmXn ~ dnXm ■ ( 22 . 25 ) 


The invariance of the space-time curvature tensor (22.16), with respect to the gauge 
transformations (22.25), is easily proven by substituting (22.25) into the formula for 
Christoffel symbols of the first kind (22.13), whereby we obtain 

T m,np = 2 phmn T 3 nhpm ~ dmhnp) 

= 2 [3 p(hmn ~ 3 'mXn ~ dnXm) T 3 n(hpm ~ dpXm ~ &mXp) 

— i)m ( h n p — 3 nXp ~ 3p/n)] = ~ {pphmn + 9 nhpm ~ 9 mhnp ) 

1 

+ —(—dpdmXn ~ dpdnXm ~ dndpXm ~ dndmXp 

+ dmdnXp + 9;??9pX/d — T m,np ~ 2dpd n Xm- (22.26) 


By definition (22.14), we then obtain the Christoffel symbols of the second kind 


p/c fjkm p p/c no a 

1 np ~ 6 1 m,np — 1 np ^ °p°nA 


(22.27) 


Substituting (22.27) into (22.16), we obtain 


R 


>k o r~ik o p 

l mnp u n l mp U P L mn 


k = dn(^ lp - 2dpd m X k ) 


— dpi^mn ~~ 23„3 mX k ) — dn^mp — 3 p^mn ~ Rmnp’ 


(22.28) 
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and the result (22.28) shows that the components of the space-time curvature are indeed 
invariant with respect to the gauge transformations (22.25). As in the case of the electro- 
magnetic fields, we can now choose a specific gauge condition for linearized gravitational 
fields. It is common to adopt the harmonic gauge condition, where we use the result (9.43) 
to require that 


8 kn = g kn g mp ( dng kp - \dpgkn) = 0. 

In the linearized case, the gauge condition (22.29) becomes 


(22.29) 


r) kn ri mp (dnhtp - \d p h kn ) = r, m P 


1 


dnhp — —3 pi^'dlk-p) 


= n mp (d„h n p - ha p hj = o, 


or 


dnhp — —dph. 

Let us now recall the linearized Einstein field equations (22.24), in the form 

3/j dj,h P n + d m d p h ^ — d n d m Il — I I ll mn — dpd^hj 

16jtG 

+ VmnCih = — ^ 'nin ■ 

Substituting the gauge condition (22.31) into (22.32), we obtain 

2 d n dmh + — 3 mdnh — d n dmh — □ hmn ~ T^dmn^l^dp'djh 

n , _ 16 ' tG t 
3 " VmnCrl — . t mn . 


(22.30) 


(22.31) 


(22.32) 


(22.33) 


By inspection of (22.33) we readily see that the first three terms on the left-hand side cancel 
each other, while the fifth term is equal to minus a half of the sixth term. Thus, (22.33) can 
be significantly simplified to read 


or 


, 1 16jtG 

-Oh mn -\- -ljmnOh = — — T mn , 


nil 1 16 *G_ 

U ( 'him — —r]mnh J — j tmn- 


(22.34) 


(22.35) 


The linearized gravitational field equations (22.35) can be further simplified if we intro- 
duce the so-called trace-reverse tensor as follows: 


such that 


hmn ~ hmn ~ 2 Pmnh ’ 


U — n mn h — n mn h --n mn n h — h—-ih — —h 
n — p Hmn — ' 1 "inn — — >1 nmn't — a— — 'iff — 'h 


(22.36) 


(22.37) 
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where we used the property the diagonal tensors (22.5), 

r] mn rimn = 00 + r) U rn i + >? 22 7)22 + >? 33 »733 = 4. (22.38) 

Using the definition (22.36), (22.35) become 

n/i,,,,, = 7 T mn . (22.39) 

c 4 

Equations (22.39) are linear wave equations for the potentials of the weak gravitational 
fields. They allow for the solutions that represent the propagating gravitational waves. 


22.3 Plane-wave Solutions 


In vacuum, far away from the field sources (where T mn = 0), the linearized gravitational 
field equations (22.39) become 





hmn — 0 . 


(22.40) 


Equations (22.40) have the plane-wave solutions of the form 

hmn = 4 m „ e 'V = A mn e i(Mt - lf \ (22.41) 

where A mn is a constant symmetric covariant tensor and 

k q = (pfc) => k q = i)qpk p = {p -k], (22.42) 

is a constant four-vector, known as the wave vector. Substituting the solutions (22.41) into 
the wave equations (22.40), we obtain 


d 2 h mn 

dxPdx p 


= Amnr,PS dSdtf (' ^ *’) = A mne ik ^v ps (ik P W s ) 

— hmnkpk^ = 0 , 


(22.43) 


which leads to the following condition for the components of the wave vector k p , 

2 

k„k p = - |fc| 2 = 0 =Hfc| = (22.44) 

p c 2 c 

The condition (22.44) is analogous to the corresponding condition for the electromagnetic 
plane waves (e.g., light), and it implies that k p is a null four-vector as in the case of 
light rays. Thus, the gravitational plane waves propagate with the speed of light (13.1) in 
vacuum. The constant covariant tensor A mn defines the polarizations of the gravitational 
plane waves. In order to specify the components of the tensor A mn , let us first recall the 
harmonic gauge condition for the linearized gravitational fields (22.31), in the form 
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where we note that the mixed metric tensor of the pseudo-Euclidean space is equal to the 
unit <5 -tensor, 

g' p n = 8r,g qm = = S™, ( 22 . 46 ) 

such that the mixed trace-reverse tensor (22.36) is defined by 

h’? = h™ - ( 22 . 47 ) 

in agreement with (22.45). Substituting the solution (22.41) into the condition (22.45), we 
obtain 

dmh’p = rjp n d m h mn = i(k m A mn )r, pn e ik 1* = 0 . ( 22 . 48 ) 

Thus, the harmonic gauge condition (22.45) requires that A mn is orthogonal (transverse) 
to the direction of wave vector k m , that is 

k m A mn = 0 , (n = 0 , 1 , 2 , 3 ). ( 22 . 49 ) 

In general, an arbitrary rank-2 tensor A mn in the four-dimensional pseudo-Euclidean 
space-time has 16 independent components. The symmetry of the tensor A mn , that is, 
the condition A mn =A nm , reduces the number of the independent components of A mn 
to ten. Each of the six components of the matrix [A mn ] below the main diagonal are 
equal to a corresponding component above the main diagonal. The four gauge conditions 
(22.49) reduce the number of independent components of the tensor A mn to only six. As a 
final step, we can now recall the contravariant form of the gauge transformations (22.25), 
that is, 

h mn h mn = h mn - d m 6 n - d n e m , ( 22 . 50 ) 

such that 

d m h mn = a m h mn - ue n - d n (d m e m ) = o. (22.51) 

Thus, if we can find a four-vector 6 n (x p ) such that DO" and 9 m 6 m vanish, then the gauge 
transformation (22.50) preserves the harmonic boundary condition 

d m h mn = d m h mn = 0 . ( 22 . 52 ) 

A suitable four-vector 9 n (x p ) can be written in the form 

0 m - iC m e ikqXq , ( 22 . 53 ) 

where C m is a constant transverse four-vector perpendicular to the wave vector kP. Using 
now the condition (22.44), we readily obtain 

U9 m = -(k p k p )9 m = 0 , ( 22 . 54 ) 


= -(k m C m )e ilCqXq = 0, 


and, since k m C m = 0, we also have 

d m e m 


(22.55) 
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such that the condition (22.52) is indeed satisfied. Using further (22.50), with (22.41) and 
(22.53), we can write 

h mn = (Amn + kmCn + k n Cm)e ilc *** = A mn e ik ^ . (22.56) 

Thus, we conclude that with a suitable choice of the constant transverse four-vector C m , 
we can impose four more conditions to the components of the tensor A mn . This reduces 
the number of independent components of A mn to just two, and these two independent 
components of A mn represent the two polarizations of the gravitational waves. For exam- 
ple, by choosing 

A m o = Aom = A m o + k m Co + koCm — 0, (22.57) 


we can fix that all temporal components of the tensor A mn vanish. From (22.57) with 
m = 0, we readily obtain 

Aoo = Aoo + 2fcoCo = 0 =>■ Co = — — (22.58) 

2fc 0 


while for m = a (a = 1, 2, 3), we obtain 

AaO = Aoa = A*o + k a Co + koC a = 0, (22.59) 


or 


C a 


Ao a k a ^ Ao a ( k a A 

~1 *T " k~o C ° = 2(fep2 00 ' 


For the four- vector C m given by (22.58) and (22.60), we can calculate 


k m Cm = k°C 0 + fc“C a 


Aoo kfAoa Wka 
■ 2 k 0 + 2(k 0 )2 Ao ° 


Using here k?k a = —k^ by (22.44), we obtain 

k m C m = —Aoo j - — = — (k°Aoo + kf* Aoa) = 0. 

k 0 k 0 \ ) 


(22.60) 


(22.61) 


(22.62) 


where we used (22.49) with n = 0, to confirm that the above choice of the four-vector C m 
is indeed perpendicular to the wave vector k m . Thus, we see that we can always put all 
temporal components of the tensor A mn to zero. This leaves us with nine space-oriented 
components for m, n = 1,2,3. In order to specify these components, without loss of 
generality, we can choose the direction of the wave propagation to be along the z-axis 
[m = 3). Such a choice gives the following wave vector: 

k p = {-, 0, 0, 7c), k=~, (22.63) 

c c 

whereby we can again use (22.49) to write 

k m A mn = k 0 A 0n + k 3 A 30 + k 3 A 3a = kA 3a = 0, (22.64) 


which implies that the longitudinal components ( m , n — 3) of the tensor A mn are all zero. 
Since now both temporal and longitudinal components of A mn can be put to zero, the 
symmetric tensor A mn itself can be displayed as the following matrix: 
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[A mn ] = [ A mn ] = 


0 0 0 

0 A xx A xy 

0 A X y Ayy 

0 0 0 


0 

0 

0 

0 


( 22 . 65 ) 


As a final step, we adopt the so-called transverse traceless (TT) gauge where we require that 


h = r, mn A mn = - (A xx + Ayy) = 0 , ( 22 . 66 ) 

which specihes the final condition on one of the three remaining independent compo- 
nents of A mn , as specified in (22.65). Thus, we have Ayy = —A xx , and 


[A mn ] = [A mn ] = 


0 0 

0 A xx 
0 A xy 

0 0 


0 0 

A x y 0 
~A XX 0 

0 0 


( 22 . 67 ) 


fn the traceless gauge we have h = h = 0 and there is no need distinguish between the 
traceless tensors h mn and h mn . Thus, the plane-wave solution has the form 


[hmn ] 


0 0 0 0 
0 h + h x 0 
0 h x —h + 0 
0 0 0 0 


gi(cot-kz) 


(22.68) 


where we introduced the notation h + = A xx and h x = A xy for the amplitudes of the two 
polarization states of gravitational waves in z-direction. These two polarization states can 
be describes by means of two polarization tensors 


0 0 0 O' 
0 0 10 
0 10 0 
0 0 0 0_ 

such that the plane-wave solution can finally be written as follows: 

— ( [i , _j_ p x ) kz) 

'him \ ,l + b mn ' l 'x b mn) *- 


\ s mn\ 


0 0 0 0 

0 1 0 0 

0 0-10 
0 0 0 0 


( 22 . 69 ) 


( 22 . 70 ) 


22.4 Effects of Gravitational Waves on Particles 

The detection of gravitational waves is based on the possibilities to measure the tiny 
relative displacements of test particles due to the interactions with gravitational radiation, 
ft is therefore important to investigate the effects of gravitational waves on point particles. 
We therefore consider a single test particle at rest for s = 0, and use the geodesic equa- 
tions (11.25), 

d 2 x n „ dV dx 1 

i_ r” = a 

ds 2 i l ds ds 


( 22 . 71 ) 
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At s = 0, the particle is at rest, such that 


( dx m \ [ dx° I 

(-s-L- 


and (22.71) becomes 


/ d 2 x n \ 

V ds 2 ) s= o “ 


, „ cLd dx' 1 „ 

^ ds ds I -- cr «o- 


(22.72) 


(22.73) 


s=0 


On the other hand, from (22.14), we obtain 


r 00 = 2 ^ ( 9 °^‘7° + 9 oho q — dqhoo) = 0. 


(22.74) 


In TT-gauge, T", is identically equal to zero, as we have put all temporal components of 
h mn equal to zero. Thus, in TT-gauge, we have 



d 2 x“ \ 
ds 2 j s=0 


= 0, 


(22.75) 


which implies that a particle, at rest before the arrival of a gravitational wave, remains at 
rest. Thus, it is not possible to detect any effect of gravitational waves on a single particle. 
In other words, the coordinate position of a point particle, initially at rest, does not change. 
What may change is the relative proper distance between two particles, and the effects of 
gravitational radiation can be detected by measuring the proper distance between two 
particles. In order to illustrate this point, we can use (22.68) to write down the linearized 
metric tensor in the form 


fen] — [>lmn] + [hmn] 

'1 0 0 

0-1 0 
“ 00-1 
_ 0 0 0 - 

Thus, the space-time metric becomes 

ds 2 = g mn dx m dx n — c^dt 2 — [1 — h+ cos (cut — kz)]dx 2 

+ 2h x cos (ait — kz)dxdy — [1 + h + cos(a> t — fcz)]dy 2 — d 2 
= c^df 2 - dZ 2 . 



' 0 

0 

0 

0 ' 

+ 

0 

0 

h+ 

h x 

h x 

-h+ 

0 

0 


0 

0 

0 

0 


cos(a it — kz). 


(22.76) 


(22.77) 


Consider now two particles that are initially (for s = 0) at rest along the x-axis, with L being 
their initial unperturbed relative proper distance. In such a case y = z — 0 and the spatial 
distance element along the x-axis, denoted by dl x , is obtained from (22.77) as 

dZ* = (l — h + cos cut) dx xs ^1 — cos cotj dx, (22.78) 

where we used the approximation *J1 — a ~ I — a/2, valid for small values of a. Integrating 
(22.78), we obtain 

l x (L) m l( 1 — cos wt 


(22.79) 
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In an analogous way, for two particles initially separated by a proper distance L along the 
y-axis, we obtain 


ly(L) 


1 + — COS Cot 


(22.80) 


Thus, when the separation along the x-axis is compressed, the separation along the y-axis 
is elongated and vice versa. But there is no change in the longitudinal separation, that is 
the separation along the z-axis, where we readily obtain l z (L) = L. This can be explained by 
the fact that, similarly to the electromagnetic radiation, the gravitational radiation consists 
of transverse waves. 

As a next step, we consider the effects of the gravitational radiation on two particles 
separated by a relative proper distance L along the lines y = ±x (dy = ±dx), such that 

Ax m =L jo, -=,±4=,ol . (22.81) 

I V2 V2 I 

The square of the spatial distance elements along the lines y = ±x, denoted by dt±, is 
obtained from (22.77) as 


and we have 


d/| = (l — h + cos cuty2h x cos cut + 1 + h + cos cut) dx 2 
= 2 (l h x coscut) dx 2 , 


dl± « V2 ( 1 ^ y coswtj dx. 


Integrating (22.83) over 0 < x < L/s/2., we obtain 


l±(L) ^ L[ 1 y cos a>?j dx. 


(22.82) 


(22.83) 


(22.84) 


From the analysis of the present section, we conclude that the two independent polar- 
izations of gravitational radiation are at an angle of 7r/4 with respect to each other. While 
the effects of a plus-polarized wave {h + ) are along x-axis (or y-axis), the effects of a cross- 
polarized wave are along the lines y = ±x, which are rotated by an angle of 7r/4 relative to 
the x-axis (or y-axis). 

This property of gravitational waves differs from the corresponding property of elec- 
tromagnetic waves, where the two independent polarization directions are at an angle 
of 7r/2 with respect to each other. A deeper analysis of this fundamental difference 
between the electromagnetic and gravitational radiation is beyond the scope of the 
present book. 

Nevertheless, it is worth mentioning that this difference is related to the tensor charac- 
ter of the two fields. While the potential of the electromagnetic radiation (16.1) is a four- 
vector, that is, a rank-one tensor, the potential of the gravitational radiation is the rank- two 
tensor h mn . In the language of quantum field theory the carriers of the electromagnetic 
radiation (photons) are spin-one particles, while the carriers of the gravitational radiation 
(gravitons) are spin-two particles. 
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22.5 Generation of Gravitational Radiation 


In the previous two sections, we discussed the solutions to the linearized Einstein field 
equations that describe the gravitational radiation in vacuum (22.40). In the present 
section, we relate the perturbation tensor h mn to the sources of the gravitational radiation 
T mn , and study the solutions of the more general wave equation (22.39), 

( V2 - ^ ^ T mn . (22.85) 


In analogy to the result (17.50) for electromagnetic potential, we can readily write down 
the solutions to (22.85) as follows: 


hmn (Vfi 



Tmn( x ° ~ R> V*) 
R 


dV, 


(22.86) 


where the integral is over the domain V where the sources of the gravitational radiation 
are distributed. The quantity R in (22.86) is the three-dimensional distance between the 
position of the sources and the position where the potentials are calculated, that is, 


R = Jlx 01 - Vq 2 . (22.87) 

We also introduce the notation r=( \x a \ 2 ) 1 ^ 2 for the three-dimensional distance from 
the origin of coordinates to the position where the potentials h mn are calculated. Let us 
now assume that the spatial extent of the sources is negligible compared to the distance 
between the source and the position where the gravitational field potentials are calculated, 
that is, that |x" | » |x“| (a = 1, 2, 3). Then by definition (22.87), to the lowest order, we have 
R^ r and the solution (22.86) can be written as 

h mn (x°,xf) = — f T mn (x° - r,x a )dV. (22.88) 

c 4 r Jv 


Here we no longer distinguish between h mn and h mn , since we are using the TT-gauge 
where the two tensors are equal to each other. In order to calculate the integral on the 
right-hand side of (22.88), we refer to the energy-momentum conservation law, as defined 
in (17.58) for the case of electromagnetic fields, that is 

d m T mn = d 0 T 0n + d a T an = 0. (22.89) 


Differentiating (22.89) with respect to x°, and letting n = 0, we obtain 

d$T 00 = -d a (d 0 T 0a ^ , 

where we used the symmetry of the source energy-momentum tensor ( T mn 
the other hand from (22.89) we also have 

90 7 °“ = -dpT? 01 , 

such that 

9 0 2 r 00 = -da (-dpT^) = dadpT^, 


(22.90) 
T nm ). On 

(22.91) 


(22.92) 
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Multiplying (22.92) by x p x q and integrating over the source domain V, we obtain 


I=d$ J dVxPtf T 00 = / dVd a d B T aft x p x q . 


L 


(22.93) 


Integrating by parts twice, using Gauss theorem (10.47) and dropping the surface integrals 
as being equal to zero, as indicated below, we obtain 


7 = 


[ dVd a (dp T a P x p xd) - f dVdp T aP d a (x p xd) 
Jv Jv 

j>_ dn a dp T af> x p xd - J dVdp T af) (sffl + &2x p ) 


[ dVdp 
Jv 


T qP x p 


= -[ d Vdp T pP x q - 
Jv 

= - [ d Vdp (T pP x p ) + f dVT p P~dp (x?) 

Jv Jv 

- [ dVdp (T pP x p ) + f dVT^dp ( x p ) 

Jv Jv 

= ~j>_ dft/i T pP xl + J dv T pP S 

- (j) df^ T qP x p + J dy T qf> & p = 2 J dv T pq . 


Substituting (22.94) into (22.93), we obtain 


! f dVT pc < = f dVx p x q T 00 , 
Jv Jv 


or in our specific case 


2 J dV T mn (x" -r,x a ) = d^ / dV x m x n r uu (x u - r,x“) 


-s. 


(22.94) 


(22.95) 


(22.96) 


Let us now recall the expression for the energy- momentum tensor of a matter distribution 

(19.72), 


j'mn 


(p+pc 2 ) u m u n -g mn p. 


(22.97) 


In the linearized case (g uu ~ 1) and in the non-relativistic limit (u ~ 1), we have T' 
such that 


-oo 


per, 


! dV T mn (x° - r,x a ) = -^ J_^dV x m x n p(x° - r, x“) 


where we used the identity 


0 C 2 dt 2 dt 2 ' 


(22.98) 


(22.99) 


Introducing here the quadrupole mass moment by definition 


KH 


= / dVx m x n p{t- -,x“), 


( 22 . 100 ) 
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we obtain 


2 f d V T mn (x 



( 22 . 101 ) 


J v 


where the double dots over I mn denote the second- order time derivative. Substituting 
(22.101) into (22.88), we finally obtain 



( 22 . 102 ) 


From the result (22.102) we see that the lowest-order contribution to gravitational radia- 
tion is the quadrupole radiation. This is in agreement with the earlier discussion about the 
carriers of the gravitational radiation (gravitons) being spin-two particles. 

This also indicates that, similarly to the electromagnetic radiation, there is no 
monopole radiation of the gravitational waves. However, unlike the electromagnetic 
radiation, there is no dipole radiation of the gravitational waves either. Both monopole 
and dipole mass moments contributions to the gravitational radiation must vanish due to 
the energy-momentum conservation. 


PART 5 


Elements of Cosmlogy 





The Robertson-Walker Metric 


In the previous two chapters we have shown that the general theory of relativity provides 
the solutions for the space-time structure created by any given matter distribution. Thus, 
if we can specify the average distribution of matter in the entire universe, the general 
theory of relativity provides the solution for the average space-time structure of the entire 
universe. The study of such a solution for the average space-time structure of the entire 
universe is a part of the subject of cosmology. 


23.1 Introduction and Basic Observations 


In the present chapter we study the phenomena on the cosmological scale. For that 
purpose we need to develop a suitable model of the universe and to make suitable 
assumptions about the physical processes that are dominant on the cosmological scale. 
The two basic observations that allow us to address the large-scale structure of the 
universe are the expansion of the universe (the Hubble law) and the cosmic microwave 
background radiation. 

The first basic observation that allows us to address the large-scale structure of the 
universe is the discovery that the spectral lines of the distant galaxies are shifted towards 
the red end of the spectrum. If we interpret this red shift as a Doppler shift, this observation 
leads to the conclusion that all the distant galaxies are receding from us. Thus, we conclude 
that the universe as a whole is expanding. This expansion implies a finite age of the 
universe. In other words, by observing the sky we can only see stars that are close enough 
for the radiation originating from them to reach us in such a finite time. 

It has been shown by Hubble that the velocity of the distant galaxies v increases linearly 
with their distance r. This proportionality is known as the Hubble law. Using simple 
Euclidean geometry, the Hubble law can be formulated as follows: 


v = Hr, 


H = (55 ± 7) 


km 1 
s Mpc’ 


(23.1) 


where the quantity H is called the Hubble constant. The Hubble constant is a constant in 
a sense that it does not depend on the magnitude or the direction of the vector r. However 
it may depend on time and the numerical value given in (23.1) is its present-time value. 
According to the Hubble law the universe is in the uniform expansion, which means that 
there are no privileged positions in the universe and that the observer traveling with any 
galaxy sees the surrounding galaxies as receding from him. If we observe from earth two 
different distant galaxies Gi and G 2 , then according to the Hubble law their respective 
velocities observed from our galaxy are given by 
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vi=Hn, V2=Hr 2 . (23.2) 

The relative velocity of the galaxy Gi as observed from the galaxy G 2 , denoted by V, is then 
obtained as follows: 

V = v\ -v 2 = H (j] - ? 2 ) = HR, (23.3) 

where R = r\ — is the relative distance between the two galaxies Gi and G 2 . Thus, 
the observer traveling with galaxy G 2 sees the galaxy Gi and indeed any other galaxy 
as receding from him. Although we have used non-relativistic approximation in the 
Euclidean geometry and ignored the time dependence of the Hubble constant, the above 
general conclusions are nonetheless valid, that is, the observer traveling with each galaxy 
sees all the other distant galaxies as receding from him. 

Since the distant galaxies are receding from us with a velocity directly proportional 
to their distance, there must be a point at which each of them will approach the speed 
of light and the relativistic effects will become dominant. The set of all those points 
is called the world horizon. The radius of the world horizon is approximately equal to 
rwH = c/H = 2 x 10 10 lightyears, if the present-day value of the Hubble constant is used. 
The galaxies which are located at or beyond the world horizon are invisible to us. 

The second basic observation that allows us to address the large-scale structure of 
the universe is the discovery of the extremely isotropic cosmic microwave background 
radiation. This radiation has the same intensity in all directions with a precision better 
than one part in a thousand. Regardless of the origin of this radiation, this observation is a 
convincing evidence for the assumption that any acceptable model of the universe has to 
be an isotropic model. 

Another interesting feature of the microwave background radiation is its relatively high 
energy density. It is by far the component of the diffuse radiation in the universe with 
highest energy density. The universe today is matter-dominated, but there are reasons to 
believe that the universe was radiation- dominated in the early phases of its history and 
that the observed microwave background radiation originates from these early phases of 
the history of the universe. 

23.2 Metric Definition and Properties 

The basic observations described in the previous section indicate that the large-scale 
structure of the universe is both homogeneous and isotropic. This assumption is often 
called the cosmological principle. We imagine that the matter in the universe is on the 
large-scale evenly distributed in the form of a cosmic fluid, with no shear-viscous, bulk- 
viscous and heat-conductive features. This is a good approximation of the actual universe 
as long as we take the large-scale point of view. Clearly, on the smaller scale, the matter in 
the universe is unevenly distributed and the universe is highly non-homogeneous. 

We denote an observer at rest with respect to the cosmic fluid as a fundamental 
observer. As the universe expands, the cosmic fluid takes part in the expansion and the 
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fundamental observer is co-moving with the fluid. Every co-moving observer in the cosmic 
fluid sees the same isotropic and homogeneous image of the universe. The objective of the 
present section is to define the space-time metric in the co-moving frame of reference. We 
cannot use the static Schwarzschild metric to describe the expanding universe, which is 
certainly not static. We need a non-static, homogeneous and isotropic metric to describe 
the entire universe. 

Let us take the proper time measured by the fundamental observer co-moving with 
the cosmic fluid as the time coordinate, which we here denote by w. In such a case the 
space-time metric has the following general form: 

ds 2 = c 2 die 2 + 2cgo a dwdx 01 + g a p dx a dxC (23.4) 


Applying now the condition of isotropy and spherical symmetry to the metric (23.4), we 
obtain 


ds 2 = c 2 die 2 — D(r, w) dr 2 — cE(r, w) dr die 

— F(r, w) (d$ 2 + sin 2 9d<p 2 ^ . (23.5) 

In (23.5) it should be noted that we have chosen the dimensionless radial coordinate 
r = 71/ L, where TZ is the usual radial coordinate with the dimension of length and L is a yet 
unspecified reference length of the universe. The functions D(r, w ) and F(r, w) therefore 
have the dimension of the square of length and the function E(r, w) has the dimension of 
length to secure the proper dimension of the metric (23.5). Let us now use the geodesic 
equations of a particle given by (11.32), in the form 


d ii j 

ds 


l - d Mu l u k . 

2 dxi 


For a particle at rest in the co-moving frame of reference we have 


= 1 , u a = 0 (a = 1,2,3). 


(23.6) 


(23.7) 


Substituting (23.7) into (23.6) and using goo = c 2 , we obtain 


d_ ui 
ds 



d 

ds 


(gpU 0 ) 


Thus, we obtain 


d w dgj 0 _ l 9g 00 
ds dw 2 dxi 


dgjo 

dw 


= 0 


- — E{r, w) = 0. 
dw 


(23.8) 


(23.9) 


From (23.9) we conclude that E — E(r) is not a function of time coordinate w, and the 
metric (23.5) becomes 


ds 2 = c 2 dw 2 — D(r, w) dr 2 — cE(r) drdw 
—F(r, w) ( do 2 + sin 2 9 d <p 2 ^ . 


(23.10) 
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The term proportional to dr d w can be eliminated by introducing a new time coordinate 


t = w — — / E(r ) dr =>■ d w = dt + —E(r) dr. 


hi 


2c 


Using (23. 1 1) , we may write 


and 


c 2 dw 2 = c 2 dt 2 + cE(r) drdt + - [E(r)] 2 dr 2 , 

4 


—cE(r) drdw = —cE(r) drdt — - [E(r)] z dr 2 . 


Substituting (23.12) and (23.13) into (23.10), we obtain 

ds 2 = c 2 dt 2 — A(r, t ) dr 2 — B(r, t) (^dd 2 + sin 2 6 d <p 2 \ 

where 


gn = A(r, t) = D\r, w(r, t)] + ^ [E(r)f , 


(23.11) 


(23.12) 


(23.13) 


(23.14) 


(23.15) 


g 22 = B(r, t) = F[r, w(r, t)\. 

The functions A(r, t) and B(r, t ) defined by (23.15) have the dimension of the square of 
length to secure the proper dimension of the metric (23.14). In order to further specify the 
functions A(r, t) and B(r, t), we now apply the condition of homogeneity of the space-time 
and consider the following coordinate transformation: 

z" = x" +s a (r,9,(p) (or = 1,2,3). (23.16) 

Using the initial assumption that e° = 0 and (23.16) we obtain 

dz° 


dxP & dxP ’ dxP 


= 0 . 


(23.17) 


The general transformation law for the covariant metric tensor, defined with respect to the 
systems of coordinates {x k } and \z k ) by g kn and g /7 respectively, is given by 

dzi dz l _ 

8kn ~ dx* dx" Sil ' 

Using (23.17) and (23.18) we may write 


(23.18) 


_ dz v dz a _ 
8afl ~ dFdrf gv ‘ 


Substituting (23.17) into (23.19) we obtain 






dx a 


ds a \ _ 


(23.19) 


(23.20) 


Expanding the expression (23.20) and dropping the term quadratic in the derivatives of 
the small translational parameters e", we obtain 

de v ds v 

g a ft(x k ) = g a p(z k ) + —g V f>(z k ) + -^g va {z k ). 


(23.21) 
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The homogeneity of space-time now requires that the metric is invariant with respect to 
the transformation (23.16), that is, that we have 

gafi(z k ) = g a p(z k ) (a, p = 1, 2, 3). (23.22) 

Substituting (23.22) into (23.21), we obtain 

ds v ds v 

gap (X k ) = g a p ( Z k ) + — ( Z k ) + J^gva ( Z k ). (23.23) 

On the other hand, expanding g a p (z k ) into the Taylor series, we may write 

g a p(z k ) = g a p{x k ) + (23.24) 

Substituting (23.24) into (23.23) and dropping the terms quadratic in the small transla- 
tional parameters e" and their derivatives, we see that only the zeroth-order term of (23.24) 
contributes to the first-order equation (23.23). Thus, we obtain 


lr - K d& V , K d& V , fox 
gap (X k ) = g a p(Z k ) + — g v p(X k ) + ~^g va (X k ), 


or 


3e v 


3e v 


g a p(z k ) = - ^g v p(x k ) - ^g va (x k ). 


(23.25) 


(23.26) 


Using again the homogeneity condition (23.22) and comparing (23.26) and (23.24) with 
each other, we obtain 


3 gap 


3 e 


ds v 


gV + ^S V p(X k ) + 7-zgva(X k ) = 0. 


Fora = P = 1 and using the metric (23.14) with (23.15), (23.27) becomes 


3 A , 3s 1 

— s 1 +2 A= 0. 

dr dr 


For a = 1 and p — 2 and using the metric (23.14) with (23.15), (23.27) becomes 


de 2 n 3s 1 „ 

B-\ A = 0. 

dr dd 


Finally for a = P — 2 and using the metric (23.14) with (23.15), (23.27) becomes 


3 B , de 2 
— s 1 +2 — B= 0. 
dr dd 


(23.27) 


(23.28) 


(23.29) 


(23.30) 


From the differential equations (23.29) and (23.30) it is possible to eliminate e 2 . We first 
rewrite (23.29) and (23.30) as 


de 2 _ 3 e 1 A 
~dr ~ ~d8B’ 


de 2 

~de 


1 3 B 
2B~dr 


e 1 = y r (in B-^s 1 . 


(23.31) 


From (23.31) we may write 
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2 „2 


d 2 e 


drdO 


9 2 e 1 A 

~d Wb’ 


d 2 e 2 


drdO dr 2 V / dr V / dr 


Addition of the two equation (23.32) gives 
a 2 g 1 A d 2 


(in B- 1 ' 2 ) e 1 + - (inB- 1 ' 2 ) 
dd 2 B dr 2 V / 9r V 7 

Using now (23.28) and (23.30), we may also write 

18A 9 , . 2 9s 1 

A dr dr ' ' D ar ’ 


9s 1 

~9r 


= 0 . 


9r 


1 9B 9 , . 2 9g 2 

= — (InB) = — r — . 

R dr v ' pi dfl 


(23.32) 


(23.33) 


(23.34) 


B dr dr K ’ g 1 dd 

As the small translation parameters e" are by definition independent on the time coordi- 
nate t, we conclude that 9,-(ln A) and 9 r (ln B) are not the functions of the time coordinate t 
either. But the functions A(r, t) and B(r, t) are dependent on the time coordinate t. This is 
only possible if the functions A(r, t ) and B(r, t ) are factorized as follows: 


A(r, t) = a(r)R 2 (t), B(r, t) = b(r)R z {t), 


(23.35) 


where R(t) is some yet unspecified function of the time coordinate t which has the dimen- 
sion of length. This function in some sense plays the role of the radius of the universe. The 
function R(t) should not be confused with the Ricci scalar R used in the gravitational field 
equations, and whenever there is a risk of confusion the distinction between the two will 
be explicitly stated. On the other hand we note that the dimensionless radial coordinate r 
is not uniquely defined. The metric (23.14) is invariant with respect to the transformation 
from the radial coordinate r to some new radial coordinate p, defined by 


r= I F(p)dp 


dr = F{p) dp, 


(23.36) 


where F(p) is some arbitrary function of the new radial coordinate p. Indeed, if we 
substitute (23.36) into the metric (23.14), we obtain 

ds 2 = c 2 d t 2 — A{p, t ) dp 2 — B(p, t) (d6 2 + sin 2 6 dy 2 ^ , (23.37) 


where 


gn = A(p, t) = A 
g22 = B(p, t) = B 


J F{p) dp, t 
J F{p) dp, t 


[F(P)V , 


(23.38) 


The metric (23.37) has the same form as the metric (23.14) and they are fully equivalent to 
each other. Thus, we are free to make an arbitrary choice of the form of one of the functions 
a(r) or b{r). If we set b(r ) = r 2 , the metric (23.14) becomes 
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ds 2 = c 2 df 2 — R 2 (t)a(r) dr 2 — f? 2 (t)r 2 (df? 2 + sin 2 9 di p 2 ^j . (23.39) 

From the metric (23.39) we see that the surface area of a sphere with the relative radius r is 
equal to 47r r 2 R 2 (t). If we, at the conveniently chosen time instant t, have R(t) = L, then the 
surface area of a sphere with the relative radius r = 7 Z/L is equal to Att'R 2 and the angular 
coordinates 6 and cp become the usual spherical angular coordinates. In order to specify 
the function a = a(r), we use (23.28) and the factorized form of the function A(r, t) given 
by (23.35) to obtain 

sM-£K lfl )-£K 1 ' 2 )- ,23 - 40 > 

The solution of the differential equation (23.40) can be written as 

s\r,e,ip) = e(9,(p)a~ 1/2 (r). (23.41) 


Substituting now the factorized form of the functions A(r, t) and B(r, t), given by (23.35) 
with b(r ) = r 2 , into (23.33), we obtain 

d 2 s 1 


3 e 2 


3 2 , , , 3 , x Se 1 

= — (lnr) e H (Inn — , 

n dr 2 ' 2 dr ' dr 


(23.42) 


or 

d 2 s 1 _ r 2 / 1 9g i 1 1 \_r 2 d / g 1 \ 

3 9 2 a \r dr r 2 - 8 ) a dr \ r ) ' 

Substituting the solution (23.41) into (23.43) we further obtain 

13 z e _ r 2 3 / 1 \ _ 

ed 9 2 a l l 2 dr\ra 1 / 2 ) 


(23.43) 


(23.44) 


The expression on the right-hand side of (23.44) is a function of the angular variables 
6 and <p only, while the expression on the left-hand side is a function of the radial variable 
r only. Thus, both sides of (23.44) are equal to a constant which we denote by C. In order 
to determine the value of C, we use (23.44) to write 


de 2 


= Ce. 


(23.45) 


If the translation (23.16) is chosen to be along the polar axis (z-axis) in the corresponding 
Descartes coordinates, we have e(0, <p) ~ cos 6. This determines the value of the constant 
C to be C = —1. Thus, we obtain the differential equation for the function a = a{r) in the 
form 


a 1 ! 2 dr \ r a 1 ! 2 ) 


(23.46) 


The solution of the differential equation (23.46) is 


a(r) = 


1 

1 -kr 2 ' 


(23.47) 


It is easy to verify by direct calculation that (23.47) satisfies the differential equation (23.46) 
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dr y r 

Substituting (23.47) into (23.39) we obtain the final result for the Robertson-Walker metric 
in the form 

ds 2 = c 2 dt 2 - R 2 (t) (^j~Yr 2 + r 2 dd 2 + r 2 sin2 6 d <P 2 ) > (23.49) 

where R(t) is a dynamic function of the time coordinate t with the dimension of length, 
which will be calculated as a solution to the gravitational field equations, and k is the 
so-called curvature constant, which describes the geometry of the three-dimensional 
space at any particular time instant. The positive values of the curvature constant ( k > 0 ) 
correspond to the three-dimensional space with a positive curvature, the zero value 
( k = 0) corresponds to the flat three-dimensional space and the negative values ( k < 0) 
correspond to the three-dimensional space with a negative curvature. The value of the 
curvature constant k can always be taken as +1, 0 or —1 by a suitable rescaling of the 
radial coordinate r. 

The infinitesimal element of the proper distance between two arbitrary galaxies given 
by edit can be calculated from the condition ds = 0 using the Robertson-Walker metric 
(23.49) as follows: 

dr 2 

cdt = R(t)da, da 2 = — =- + r 2 d6 2 + r 2 sin 2 6 dip 2 , (23.50) 

1 - kr l 

where da 2 is the metric of the three-dimensional space for a fixed value of the time 
coordinate t which is usually called the cosmic time. The cosmic time is a universal time 
equal for all observers at rest with respect to the local matter. Using (23.50) we may write 
c At = R(t)Ao for finite proper distances. All measurements are made at the same epoch 
t. The radial coordinate r is a co -moving coordinate and it remains fixed for each galaxy. 
The angular coordinates 6 and (p also remain fixed for the isotropic motion of each galaxy. 
Thus, the spatial metric da 2 remains fixed and the proper distance between two galaxies 
is only scaled by the function R(t) as the cosmic time t varies. The function R(t) with the 
dimension of length is therefore called the scale radius and it increases or decreases as the 
universe is expanded or contracted respectively. 


= - 1 . 


(23.48) 


23.3 The Hubble Law 

In the present section we discuss the Hubble law in the framework of the Robertson- 
Walker geometry. It has been concluded before that each galaxy has similar coordinates 
(r, 6, cp). Let us now assume that our own galaxy, being an approximately co-moving 
object, lies at the spatial origin r — 0 and that some other distant galaxy lies at some radial 
coordinate distance r. Due to the homogeneity of space-time this choice of coordinates 
does not place our own galaxy in the center of the universe, since any galaxy can be chosen 
to have r = 0. This particular choice is only made for convenience. The proper radial 
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distance D r between our galaxy at r = 0 and the distant galaxy, at some radial coordinate 
distance r and at a given cosmic time t, can be calculated using the Robertson-Walker 
metric as 

f r dr 

D r = R(t) . == ■ (23.51) 

Jo \J\ — kr z 

The integral (23.51) is elementary and gives 

I R(t) arcsin r (k = 1) 

R(t) r (k = 0) (23.52) 

R(f)arsinhr (k = — 1) 


where it should be noted that r = 7 Z/L is the dimensionless radial coordinate. Thus, the 
proper distance is proportional to the scale radius R(t) which changes with time. Keeping 
in mind that the radial coordinate r is a fixed co-moving coordinate, the proper velocity 
is obtained by differentiating the proper distance D r with respect to the cosmic time t, 
that is, 

v r = D r = Rit) f dr = ^D r , (23.53) 

Jo VI -kr z R 

where the dots denote the time differentiation. Thus, we obtain the Hubble law saying 
that at any given cosmic time t the speed of any distant galaxy relative to our own galaxy 
is proportional to its proper distance from our galaxy. The Hubble constant is given by 


Hit) = 


m 

R(tY 


(23.54) 


and we see that the Hubble constant is indeed a function of the cosmic time as we 
have anticipated earlier in this chapter. The quantity Ho measured by the astronomers 
is the value of the Hubble constant at the present epoch, that is, for t = to- It should 
be noted that the proper distance is not a directly measurable quantity and it can 
only be measured indirectly by measurements of some other quantities like the red 
shifts. 


23.4 The Cosmological Red Shifts 

In the present section we discuss the cosmological red shifts as one of the means to 
measure the proper distances to the distant galaxies and test the validity of the Hubble 
law in the framework of the Robertson-Walker geometry. Let us consider a distant galaxy 
at some relative radial coordinate distance r = r d emitting two light wave crests at cosmic 
times t d and t d + At d towards our own galaxy situated at r = 0. The two wave crests are 
received in our galaxy at cosmic times to and to + A to- For the radial motion of light the 
Robertson-Walker metric gives 

dr 2 

1 -kr 2 


ds 2 = c 2 df 2 - R z (t) 


= 0 . 


(23.55) 
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Thus, we may write 


df 1 dr 

R(t) c f 1 — k r 2 


(23.56) 


Since the beam of light is moving towards us, the radial coordinate r decreases as the time 
coordinate t increases along the null geodesic and it is appropriate to use the minus sign 
in (23.56). Thus, we may integrate (23.56) to obtain 



dr 

m 


1 C r * dr 

c Jo \! 1 — k r 2 


(23.57) 


and 


rto+Ato df 
t d +At d R(t) 


1 C Td dr 

c Jo V 1 — kr 2 


(23.58) 


For all types of radiation received from distant galaxies the time intervals A t d and Afo are 
tiny fractions of a second and over that time R(t) remains effectively constant. Subtracting 
(23.57) from (23.58) we obtain 

= o ^ = W _ (2 3. 59) 

R(to) R(t d ) A t d R{t d ) 

The observed wavelength Xo and the emitted wavelength X d are related to the time 
intervals A to and A t d by the following definitions: 


= cdAto, X d = cAt d . 


(23.60) 


Thus, the red shift of the received light waves can be obtained in terms of the function R(t) 
as follows: 


Xo— X d _ R(to) 
X d R(t d ) 


(23.61) 


In the expanding universe we have R(to) > R{t d ) and the red shifts is positive in agreement 
with the empirical observations. The red shift (23.61) is a consequence of the light traveling 
in the curved space-time and it is not a result of the Doppler effect. This red shift is called 
the cosmological red shift. Most observed cosmological red shifts are rather small and t d is 
relatively close to to. It is therefore possible to expand R{t d ) in a Taylor series around to as 
follows: 

R(t d ) = R(t 0 ) + (t d - t 0 )R(t 0 ) + i(t d - t 0 fR(to) + ... (23.62) 


or 


R(t d ) = R(t 0 ) 


1 + Ho(t d — to)R(to) 


J^qoHldd -t 0 ) 2 + ■■■ 


(23.63) 


where Ho is the present value of the Hubble constant given by 


H 0 


R(to) 
R(to) ’ 


(23.64) 
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and qo is a dimensionless deceleration parameter given by 

R{to)R{to) 
q °~ fr(to) 


( 23 . 65 ) 


The deceleration parameter qo is positive when R(t ) is negative, that is, when the expan- 
sion of the universe is slowing down. Substituting (23.63) into (23.61) we obtain 


z 


1 + Ho(t d - to)R(to) - -qoHQ(t d - to) 


( 23 . 66 ) 


or 


z 


Ho(to — t d ) + 


( 1+ ^°) 


H'oCo ~ td) 2 - 


( 23 . 67 ) 


This formula is sometimes very useful but we must keep in mind that it is only valid for 
small cosmological red shifts where t d is relatively close to to. 

When we observe a galaxy with a red shift z= 1, it means that the scale radius R{t d ) 
of the universe at the cosmic time t d when the radiation was emitted was one half of 
the present scale radius R(to). In other words the size of the universe at the time t d was 
a half of the present size of the universe. Unfortunately we do not know the cosmic time t d 
when the radiation was emitted. If we did, we could directly measure the function R(t). We 
therefore need some theory of the cosmic dynamics in order to determine the scale radius 
R(t). Such a cosmic dynamics theory is the subject of the following chapter. 



The Cosmic Dynamics 


24 


The non-static models of the universe based on the Robertson-Walker metric (23.49) 
are described by their scale radius R(t ) and the curvature constant k. The analysis in 
Chapter 23 did not determine the scale radius R(t ) as a function of the cosmic time t. 
The knowledge of the function R(t) is essential for determining the rate of expansion of 
the universe and other physical properties of the expanding universe. In order to find the 
solution for R(t), we need a theory of cosmic dynamics based on the gravitational field 
equations. We therefore combine the homogeneous isotropic Robertson-Walker metric 
with the gravitational field equations to obtain the dynamic equations satisfied by the 
scale radius R(t). These equations are called the Friedmann equations. 


24.1 The Einstein Tensor 


In the present section we use the Robertson-Walker metric (23.49) given by 

dr 2 


ds z = c 2 df 2 — R 2 (t) 


1 - kr 2 


+ r 2 d8 2 + r 2 sin 2 9d<p‘ 


(24.1) 


The covariant metric tensor for the Robertson-Walker metric is given by the following 
matrix: 


[gm«] — 


10 0 0 

0 -R 2 ( 1 - kr 2 )- 1 0 0 

0 0 —R z r 2 0 

0 0 0 —R 2 r 2 sin 2 9 


(24.2) 


The contravariant metric tensor for the Robertson-Walker metric is then given by the 
following matrix: 


[g mn ] = 


10 0 0 

0 — R -2 (l — kr 2 ) 0 0 

0 0 -R~ 2 i- 2 0 

0 0 0 — R~ 2 r~ 2 sin -2 9 


(24.3) 


Using the matrix (24.2) we may write 


gOO = 1, gll = -: 


R 2 


1 — kr 2 ’ 

g 22 = —R 2 r 2 , g 33 = —R^r 2 sin 2 9. (24.4) 

The coordinate differentials of the metric tensor components (24.4) can be calculated as 

3fcgoo = 0 (k = 0, 1, 2, 3), 
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2RR 21crR 1 2 

= drgn= -(l-kr 2 ) 2 ’ 

degn = dpgn = 0, 
dog22 = —2 RRr 2 , d r g 22 = —2 R 2 r, 
dffg 22 = dpg 22 = o, 

9og33 = —2 RRr 2 sin 2 9, d r g 33 = —2R 2 rsin 2 9, 

9eg33 = -2R 2 ! 2 sin<9 cos<9, 8^33 = 0. (24.5) 


In the results (24.5) and in the following calculations, we temporarily define by dots the 
derivatives of the scale radius R(t) with respect to the temporal coordinate x° = cl rather 
than with respect to cosmic time t to simplify the calculations and to make the results 
compatible with the results obtained elsewhere in the literature using the units with c = 1. 
Thus, we have 


m = 


d R(t) 
dx° 


1 d R(t) 
c d t 


(24.6) 


The Christo ffel symbols of the first kind for the metric (24.1) can now be calculated using 
the definition (9.28), that is, 


_ 1 / 9gjfc 8g n j 8gkn\ 
J,kn “2V 8 x 11 + 8 x k dxi ) ' 


(24.7) 


The results for the Christo ffel symbols of the first kind for the metric (24.1) are summarized 
in the following list: 


To , 00 = -Oogoo + 9ogoo - 9ogoo) = 0, 

1 

To, 01 = To , 10 = -( 8 ,-goo + 9oglO - 9ogOl) = 0 , 

1 

To , 02 = E 0,20 = -(9flg00 + 9og20 - 9ogo2) = 0, 

1 

To, 03 = E 0,30 = -( 8 <pg 00 + 9og30 — 9og03) = 0, 

1 RR 

To , 11 = -(drgoi + 9 r gio - 9ogll) = + ^ 2 , 

1 

To , 12 = E 0,21 = -(9flg01 + 9rg20 — 9ogl2) = 0, 

1 

To, 13 = E 0,31 = -(9^goi + 9 r g30 - 9ogl3) = 0, 

1 . 2 

To , 22 = -(9flg02 + 9eg20 - 9og22) = +RRl ' 

1 

To, 23 = E 0,32 = — (d(pg 02 + 9flg30 - 9og23) = 0, 

To, 33 = 2 (9<pgo3 + 9 P g30 - 9og33) = +RRr 2 sin 2 9, 
Ei, oo = -(9ogio + 9ogoi - 9;-goo) = 0, 
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r 1,01 = Tijo = -(3|'gl0 + 9ogll - 3rgbl) = — Y T~2 ’ 

1 

T 1,02 = T 1,20 = -(3flglO + 9og21 - drgoz) = 0, 

1 

T 1,03 = T 130 = -(dpgio + 9og31 - 9|-g03) = 0, 

1 R 2 kr 

r 1.11 = 2^rgn + 9,'gn - 3,'gn) = - (1 __ kr2)2 , 

1 

T 1,12 = Ti,21 = -(3flgll + 3rg21 - 9rgl2) = 0, 

1 

T 1,13 = T 131 = — (3<pgii + 9 r g3l - 9rgl3) = 0 

1 

T 1,22 = -(3flgl2 + 3flg21 - 3rg22) = +R T 

T 1,23 = Ti, 32 = “(93gl2 + 9flg31 — 9,g23) = 0, 

T 1,33 = — (9<jogi3 + 3»>g3i - 9rg33) = +-R 2 rsin 2 0, 

1 

T 2,00 = -(3og20 + 9og02 - dggoo) = 0 , 

1 

T2,oi = T 2,10 = -(3rg20 + 3ogi2 - 3egoi) = 0, 

1 . , 

T2.02 = T2.20 = -(9flg20 + 9og22 - 9flgo2> = —RRr , 

1 

T2.03 = T2.30 = -(d<pg20 + 9og32 - 9(lg03) = 0 , 

1 

T2.11 = — (9rg2i + 3rgi2 - 3flgn) = 0, 

1 2 

T 2,12 = T2.21 = — (3flg21 + drg22 - 9(1 gl2) = —R 

1 

T2.13 = T2.31 = — (3<pg21 + 9rg32 - 9(1 gl3) = 0. 
r 2,22 = “(9flg22 + de g22 — 9flg22) = 0, 

T2.23 = T2.32 = “(9<pg22 + 9(1 g32 - 9(1 g23) = 0 

T2.33 = ^(9^g23 + d<pg32 - 3d g33) = +R 2 r 2 sin(9 COS (9, 

1 

T3.00 = -(9og30 + 9og03 - d<pgoo ) = 0, 

T3.01 = T3,10 = ~(drg30 + 9ogl3 - 9^g0l) = 0, 

1 

T3.02 = T3,20 = “(9(lg30 + 9og23 — 9^go2> = 0, 

T3.03 = T3,3o = — (dipg30 + 9og33 — 9^go3> = -RRr 2 sin 2 0, 
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T 3 .ll = -OrgSl + 9rgl3 - 9?gll) = 0, 

1 

T3.12 = T 3 ,21 = -(9flg31 + 9rg23 - 9pgl2) = 0, 

T3.13 = T 3 , 3 i = -(9^g3i + 9 r g33 - 9pgi3) = —R 2 rsin 2 9, 

1 

T3.22 = 2 (9flg32 + 9eg23 - d<pg22) = 0, 

T3.23 = T 3,32 = 2 (9^g32 + 9flg33 - 9pg23) = -R 2 r 2 sin0 COS <9, 

T3.33 = 2 (^£33 + 9pg33 - 9pg33) = 0. (24.8) 

The Christoffel symbols of the second kind for the metric (24.1) can be calculated using 
the definition (9.29), that is, 

r L = S Plr lkn- (24.9) 

The results for the Christoffel symbols of the second kind for the metric (24.1) are 
summarized in the following list: 

Too = g° J T;,oo = g 00 To,oo = 0, 

r 0 i = T?o = g° J T ;> oi = g 00 To,oi = 0, 

r 0 0 2 = r 2 °o = g 0j r 7 -,02 = g 00 r 0 ,02 = o, 

r 0 °3 = r 30 = gS-,03 = g 00 r 0 ,03 = 0, 

T?1 = g 0 ; Ty, 11 = g 00 r 0 ,ii = + . 

r?2 = r°i = g 0 J rj, 12 = g 00 r 0 ,i2 = o, 
r ? 3 = r°i = g 0 J rj,i 3 = g 00 r 0 ,i 3 = o, 
r 22 = g 0Jr J, 22 = g°° T 0,22 = +RRr 2 , 

r 23 = T 3 °2 = g^Tj, 23 = g° 0 T 0 , 23 = 0, 
r 33 = g° ; Tj,33 = g°° T 0,33 = +RRT 2 sin 2 9 
Too = g^T /,00 — g 11 Ti, 00 = 0 

r oi = r} 0 =g 1J T/, oi =g n T 1>0 i = +^, 

Tq 2 = r 20 = g li n 02 = g 11 r 1 ,02 = 0, 

r 03 = T30 = g lj r ; -03 = g n Ti,03 = 0, 

Til = g 1; T/,n = g u Ti,ii = +Y~k^' 

r i2 = r 2! =g 1 J T 7 -, 12 =g n ri,12 = 0, 
r i 3 = r 31 = g 1 - / r / , 13 =g n r U3 = o 

r 2 1 2 = g 1; Tj ,22 = g u r 1,22 = -r (l - fcr 2 ) 
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r 23 = r 32 = g 1Jr j, 23 = g U ri,23 = 0, 

T 33 = g 1] r l33 = g 11 r 1 ,33 = -r (1 - fcr 2 ) sin 2 9, 

Too = g 2j rj,oo = g 22 r 2 ,oo = 0 , 
r oi =* r io = 8 2Jr j , 01 = g 22 r 2 ,oi = o, 

I® = r 20 = g 2 jr 7,02 = g 22 r 2 ,02 = 
r 03 = r 30 = g 2 jr 7 , 0 3 = g 22r 2,03 = 0 

rfi = g 2] 'rj,u = g 22 r 2 ,n = o 

r i2 = r 21 = g 2jr 7',12 = g 22 r 2 ,12 = +J, 

r i 3 = r 31 = g 2Jr ;, 13 = g 22 r 2 ,13 = 0 , 

r | 2 = g 21 0,22 = g 22 r 2 ,22 = 0 , 

r|3 = r 32=g 2j r;,2 3 = g 22 r 2 ,23 = 0 , 

r 33 = g 2 ; Tj ,33 = g 22 T2,33 = - sine cose 

Too = g 3 ; r J -,oo = g 33 r 3 ,oo = 0 , 
r oi = r io = g 3Jr j, oi = g 33 r 3 ,oi = o, 
r 02 = r 20 = g 3 -' r /,02 = g 33 r 3 ,02 = 0 , 

To3 = r 30 = g 3jr 7,03 = g 33 r 3 ,03 = + 
r?! = g^r j,n = g 33 r 3 ,n = o, 
r i 2 = r 21 = g 3 j r ;, 12 = g 33 r 3 ,12 = 0 , 

r i3 = r 31 = g 3jr 7,13 = g 33 r 3 ,13 = +- r , 

T 3 2 = g 3 ; 'r J -,22 = g 33 r 3 ,22 = 0 , 

r 23 = r 32 = g 3jr ;,23 = g 33 r 3 ,23 = ^ = ™te, 
r 3 3 3 = g 3 ; 'r J -.33 = g 33 r 3 ,33 = 0. 

From the results listed in (24.10) we can make the following conclusions: 

f 1 o = o, r° k = r £ 0 = o, 


r % = -R&p, rf 0 = r 0 ^ = ^| 

where (a, ft = 1, 2, 3). Thus, we further obtain 

r^=0, r^ = o for Oi + p, 

r^=5i5 2 i+5 2 5 2 cote fora^ 


(24.10) 

(24.11) 

(24.12) 


(24.13) 
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Using the results (24.12) and (24.13) we note that 


9)4 = ° for a^p. 


d P d P T aO = O' 


Furthermore, we can calculate the following sums: 


r J — r° -i- r 1 -i- r 2 -i- r 3 — -us — 
1 — 1 oo + 1 oi + 1 02 + 1 03 — 


4 = r? n + rj, + r? 2 + r? a = 


0/ 
rJ 

V 

rJ 

4 

4 = r 30 + r 31 + f 32 + *33 = 0 . 


3 — 
If 

kr 


— ^20 + ^21 + ^22 + ^23 — COt0, 


13 

p3 
1 23 

r 3 
1 33 


2 

U2 + r’ 


Using the results (24.15) we note that 


(24.14) 


(24.15) 


9^ = 0 for ff/ft 9 0 rU = 0. (24.16) 

The Ricci tensor for the metric (24.1) can be calculated using the definition (19.8), that is, 

Rkn = dn r j kj - 9,4 + r p kj r j pn - r p n r j pj . (24.17) 

From the result (12.48) we see that the Ricci tensor is a symmetric tensor R *■„ = R and 
that it has only ten independent components. The formulae for the components of the 
Ricci tensor for the metric (24.1) are summarized in the following list: 


R 00 = do r j oj - djPjjg + r p r J p0 - r * 4 ’ 
r 01 = Rw = 9r r lj - + r p oj r j pl - r p 0 l r J pj} 

r 02 = r 20 = dg r J oj - djr J 02 + r p 0 j r j p2 - r p 2 r j pj , 

*0 3 = *30 = d,r ] 0j - 9 l r 03 + r 0) r p 3 - r fl 3 4’ 

r u = a r r{. - 9,4 + 4.4 - 44, 

*i 2 = *21 = 9 e r{ ; . - 9,4 + 44 - r p 2 r j pj , 

* 13 = *3i = 9 ,r{. - 9 /t{ 3 + 44 - 44, 

*22 = 9<4 - 9,4 + 44 - r p 2 r ] pj , 

*23 = *32 = d v r{j - 9 4 + *2)4 “ 44- 

*33 = 9,4 - 9)4 + 4 4 - 44 . (24.18) 

The results for all ten individual components of the Ricci tensor for the metric (24.1) can 
be obtained from the above list (24.18) . However, the calculation process can be facilitated 
using the general results (24.11) - (24.16). Thus, fora, ft = 1, 2, 3 and a / p, we may calculate 

R aP = dpiij - 9 / 4 + 44 - 44. 


(24.19) 
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From (24.14) and (24.16) we see that the first two terms on the right-hand side of (24.19) 
for a / vanish and we may write 

^ ^ *i] - r i- - r i" ( 24 - 2 °) 

Using here = 0 and F^ = 0 for a ± ft from (24.13), we obtain 

Raf> = r P aj r j pp for a^fi. (24.21) 

Thus, the second term on the right-hand side of (24.21) is not equal to zero only for (a/J) = 
(12). Let us now calculate the first term on the right-hand side of (24.21), as follows: 

r aj r U = r aO r p/S + rLr^j, (24.22) 


or 


Y'P pi pO pO I p(T pO I pO p£t) I p£T ptt 

^ mi nft *■ /vfu O/J ^ ryf) ^ rrR ' * cun* f)/? ' ^ cr 


(24.23) 


It is shown by direct calculation that the first three terms on the right-hand side of (24.23) 
vanish. Thus, we obtain 

Y'P pi per ptt) pi ptu I p 2 ptt) I p 3 p 

1 aj l pP 1 &<’> nR 1 a( ’> 1 ' 1 auo 1 2R ' 1 ao> 1 


1 aco 1 aP 


a co L 2 p + 1 aco L 3 p 
1 r 3 , t- 2 ml , m2 t-2 


pi pi i pi p2 I pi p3 i p2 pi i p2 p2 I p2 p3 

— 1 al A ip + 1 a2 l 1 p 1 a3 l ip + 1 al 1 2p 1 a2 A 2p + 1 «3 A 2p 

I p3 pi i p3 p2 I p3 p3 
1 al 1 3£ + 1 «2 a 3£ + 1 <*3 A 30- 

Using the result (24.24) we can show by direct calculation that 


cote 


Y'P pi ?1 c2 p3 p3 ol o2 

1 aj [ pP ~ d a d p [ 13 1 32 — °a°p ~~ r 

Substituting (24.25) into (24.21) we finally obtain 


(24.24) 


(24.25) 


Rap = Rfia = 0, (fit (24.26) 

Furthermore, for a = 1, 2, 3, we may also calculate 

Ra0 = 9o r j aj - 9 0 r° 0 - d p T^ + r^r£ 0 - r^.. (24.27) 

Using here the results (24.11), (24.14) and (24.16), we see that the first three terms on the 
right-hand side of (24.27) vanish. We then have 

R ° 0 = r a0 r p0 + T ap r p0 ~ ^0 Kj ~ ^pj- ( 24 - 28 ) 

Using further (24.11), the first and the third term on the right-hand side of (24.28) vanish. 
Thus, we obtain 

Ro 0 = c r^ 0 + r v afi rf 0 - rf 0 r° 0 - rf 0 r^ v . (24.29) 

Using once again the results (24.11), the first and third term on the right-hand side of 
(24.29) vanish. We may then write 


D pV pP pP pV 

ft-aO 1 ap l v 0 1 aO 1 Pv‘ 


(24.30) 
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Using the result (24.24) we can show by direct calculation that 

R a0 = 0, (a = 1,2,3). 


(24.31) 


From the results (24.26) and (24.31) we conclude that all off-diagonal components of the 
Ricci tensor are identically equal to zero 


Rkn — Rnk — C (Jc ^ VL). 


(24.32) 


Thus, the only nontrivial components of the Ricci tensor are the diagonal components. 
Using the results (24. 1 1)— (24. 16) , they can be calculated as 

Roo = a 0 r^. + r£rf, 0 

= 33o (£) + (Foj) 2 + (r 0 2 2 ) 2 + (r 0 3 3 ) 2 = 3^ 
r u = a r H. - d 0 r° n - arrjj + rf ; ii - r°, ri - rj, r{ 


= 3 r 


kr 2 

rr^ + 7 1 ” 90 


i j p i 
RR 

1 — At 2 


11* o j 

-a, 


o' i / 
kr 


1 — At 2 


2 R 2 

+ - CT + 


3 R 


f?2 


; z r z 2 
1 — kr 2 ' (1 — kr 2 ) 2 r 2 1 — kr 2 


kr 


l - kr 2 \1- kr 2 
- o 


Arr 


+ ?)--T 


R 2 


7? 2R 2 + 2 A; 

- + 


- At 2 \R R 2 
l w 


R 22 = a*r'. - a 0 r 2 ° 2 - a r r] 2 + r p 2 j r J p2 - r° 22 r J 0j - r’ 2 r- ly . 

= dg (cote) - 9o ( RRi - d r r (l - Ar 2 ^ + 2R 2 r 2 

— 2 (l — Ar 2 j + cot 2 e — 3 R 2 r 2 + 2 — kr 2 

y y (R 2R 2 + 2k 

= - r ^{r + ^- 

R 33 = -aor 3 ° 3 - aula - a fl r| 3 + - r° 3 r^ - r’ 3 r^. 

— r| 3 r{ ; . = —3o ( RRr 2 sin 2 e^ — d r r (l — Ar 2 ^ sin 2 ej 


i (— sin 9 cos 9) + 2R 2 r z sin 2 6 — 2 | 


sin 2 9 


:(l-A;r 2 ) 

2 cos 2 9 — 3 ifr 2 sin 2 9 + (2 — kr 2 ^ sin 2 9 + cos 2 9 


= -R 2 r 2 sm 2 9 ( - + 
R 


R 2 R 2 + 2k 


R 2 


(24.33) 


Multiplying the covariant components of the Ricci tensor (24.33) by the corresponding 
components of the contravariant metric tensor given by (24.3) we obtain the components 
of the mixed Ricci tensor as follows: 
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K = r^22 = s + 


i? 2 R 2 + 2k 


*3 = 8 33 R 33 = ^ + 


i? 


R 2 ’ 
2R 2 + 27c 


ft 


The Ricci scalar R- \ is then given by 


R 


R = 3- +3 - + 


R 


R 2 R 2 + 2 k 


R 


R 2 


R 2 


, R R 2 + k 
_6( R + — W~ 


(24.34) 


(24.35) 


Let us now form a mixed tensor entering the gravitational field equations (19.97) as 


ck p k 

\2Tn 


-8*Rj = 
2 n 1 


8jtG T k 
C 4 1 rr 


(24.36) 


The tensor is usually called the Einstein tensor. Substituting the results (24.34) and 
(24.35) into the definition (24.36), we obtain the components of the Einstein tensor for 
the Robertson-Walker metric as follows: 


r' 0 pO 

^ 0—^0 


R 


-R J . =3- -3 - + 


2 J 


R 


R R 2 + k 


R 


R 2 


= -3 


R 2 + k 


G\ = G 2 2 = Gl=R\--R]=- + 


R 2 R 2 + 2k 


3--3 

R 


R 2 + k 
—tf— 




R 2 

R 2 + k 
— 


(24.37) 


At this stage it is appropriate to reverse the temporary convention (24.6) and to return to 
the usual definition of dots as the derivatives of the scale radius R(t) with respect to the 
cosmic time t, that is, 

d R(t) 


RU) = 


d t 


(24.38) 


This change of convention introduces an additional factor of 1/c before each derivative 
defined by the dots. Thus, the results (24.37) become 

3 R 2 + kc 2 


G 0 — 


R 2 


1 ( R R 2 + kc 2 


G}=GS = Gf = — 3 (2- + 


R 2 


(24.39) 


Equations (24.39) are the final results for the components of the Einstein tensor in the 
Robertson-Walker metric. 


24.2 Friedmann Equations 

In order to construct the gravitational field equations (24.36) we now need the energy- 
momentum tensor T* of the cosmic matter. The assumption is that the matter in the 
universe is on the large-scale evenly distributed in the form of the cosmic fluid with no 
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shear-viscous, bulk-viscous and heat-conductive features. Thus, we may use the mixed 
energy-momentum tensor of an ideal fluid, which is in the covariant form given by (19.72) . 
We may then write 

T n = (p + pc 2 ) u n u k - S k p, (24.40) 

where p is the pressure and p is the matter (or rest energy) density of the cosmic fluid. In 
the co-moving frame the cosmic fluid is at rest and we have 

u° = 1, u a = 0 (a = 1,2,3). (24.41) 

Using the results (24.41) and observing that p pc 2 , we obtain 

Tg = pc 2 , T] = rf = rf = -p. (24.42) 


Thus, the energy-momentum tensor of the cosmic fluid in the co-moving frame can be 
structured in the following matrix: 



pc 2 0 0 0 

0 — p 0 0 

0 0 — p 0 

0 0 0 —p 


(24.43) 


Substituting the results (24.39) and (24.42) into (24.36), we obtain the gravitational field 
equations in the form 


if + kc 2 8 j rG 
R 2 = ~^~ p 


(24.44) 


R if + kc 2 

2 r + ~r^~ 


87 rG 


(24.45) 


Equations (24.44) and (24.45) are called the Friedmann equations and their solution 
describes the cosmic dynamics. Combining (24.44) and (24.45) we may write 


R 87 rG 87rG 

+ 


Furthermore, from (24.44) we have 


R 2 + kc 2 = 


87rG 


-pR 2 . 


(24.46) 


(24.47) 


Differentiating (24.47) with respect to the cosmic time t, we obtain 

87rG 9 87 rG 

2RR = ~^~PR + ~^~P2RR, 


or 


R 87 rG R 87 rG 

2 R = ^ P R + ^ 2P - 


Substituting (24.49) into (24.46) we obtain 


87rG R 8n G 

+ -r 3p - 


87 rG p 


3 ^, 


(24.48) 


(24.49) 


(24.50) 


Chapter 24 • The Cosmic Dynamics 235 


or 

^ + 3 (p+J) = °- (24.51) 

Observing that the universe as a system of galaxies in the smooth fluid approximation 
behaves as an incoherent dust, we can again use the approximation p pc 2 to obtain 

' + 3 'rreM-°- (24 - 52 > 

Integrating (24.52) we obtain 

pi? 3 = p 0 i?g = Constant, (24.53) 


where po and Rq are the matter density and scale radius at the present epoch {t = to). 
From the result (24.53) we see that the matter density varies in time as R~ 3 and that the 
quantity of matter in a co-moving volume element is constant during the expansion of the 
universe. This conclusion is relevant to the present matter-dominated epoch in the history 
of the universe. 

In the early phases of its history when the universe was radiation- dominated this 
conclusion was not valid because at that stage the assumption p < pc 2 was not valid either. 
In the radiation- dominated universe equation (24.51) becomes 


P + 




(24.54) 


and the pressure component cannot be neglected. Using (24.53) and neglecting the 
pressure, we obtain the Friedmann equations for the matter- dominated epoch of the 
universe as follows: 


R R 2 + kc 2 
2 R + R 2 


= 0 , 


(24.55) 


R 2 + kc 2 8itGpo f?n 

= (24 - 56) 

The solutions of the Friedmann equations for the matter-dominated epoch of the universe 
given by (24.55) and (24.56) will be the subject of the following chapter. 




Non-static Models of the Universe 


In Chapter 24, we have derived the cosmic dynamic equations satisfied by the scale 
radius R(t), known as the Friedmann equations. As we have concluded earlier, the non- 
static models of the universe based on the Robertson-Walker metric (23.49) are described 
by their scale radius R(t) and the curvature constant k. The objective of this chapter is 
therefore to study the appropriate solutions for the scale radius R(t) and the appropriate 
values of the curvature constant k and their implications for the future evolution and other 
physical properties of the expanding universe. 


25.1 Solutions of Friedmann Equations 

In order to solve the Friedmann equations (24.55) and (24.56), we first introduce some 
useful quantities. Let us recall that the Hubble constant (23.54) is given by 


mt) = 


m 

my 


Substituting (25.1) into (24.56) we obtain 


2 kc 2 8nGpo Rq 
H = 

In the present epoch (t = to) with R — Rq and H = Hq, (25.2) becomes 

2 kc 2 8n Gpo 


or 


8jrGpo 




Hi 

6 


8tvG 

- -j- = (Po - Pc) < 


3c 2 c 2 3c 2 

where p c is the critical (or closure) matter density of the universe, defined by 

3 H 2 


Pc = 


8nG 


(25.1) 


(25.2) 


(25.3) 


(25.4) 


(25.5) 


With the range of the estimated values of the Hubble constant Hq in the present epoch, 
the numerical value of the critical density is given by 


pc = 2 x 10 _26 fl 2 -%, 0.5 < a < 1. 

m 6 


(25.6) 
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Let us also recall the definition of the deceleration parameter q(t) defined according to 
(23.65) as follows: 


q(t) = 


mm 

R 2 (t ) 


1 R 
RPR' 


(25.7) 


From the Friedmann equations (24.55) and (24.56) we may write 

if 47rGpo Rq 


R 


R 3 ’ 


Substituting (25.8) into (25.7) we obtain 


4,t Gpp flg 
q{ ) 3 H 2 if 3 ' 

The present-epoch value of the deceleration parameter (25.9) is given by 

4 tt Gpp _ _po_ 

3 H 2 ~ 2 pc' 


do = 


(25.8) 


(25.9) 


(25.10) 


From (25.10) we see that the present-epoch value of the deceleration parameter can be 
expressed in terms of the present and critical matter densities of the universe. 


25.1.1 The Flat Model (k = 0) 


For the flat model with k = 0, (25.4) gives po = Pc and from (25.10) we find qo = 1/2. Using 
p 0 = p c and (25.5) we may write 


2 8jrGpo 

tin = . 

0 3 

The Friedmann equation (24.56) then becomes 

b2 SjrGpoffo ^0*0 

R = 3R = ~JT 
or 

if 2 = ^, a = h 0 r 3 0 /2 . 

Equation (25.13) can be rewritten as follows: 


if = = AR-b 2 => J VRcLR = aJ df+ C. 


(25.11) 


(25.12) 


(25.13) 


(25.14) 


Performing the elementary integration in (25.14) we obtain 

2 -^ 2 =At+C => R(t) = 0 ^ 3 t 2/3 , 


(25.15) 


where we used the initial condition R( 0) = 0 to set the integration constant C in (25.15) 
equal to zero. From (25.15) we may also write 


t 


2 r,,, 2 

— if 3 / 2 = 

3A 3 H 0 



(25.16) 
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The present-epoch equation (25.16) for t = to has the form 

2 


to = 


3//n ' 


(25.17) 


The solution (25.15) for the scale radius R(t) with the choice of the curvature constant 
k — 0 is in the literature sometimes called the Einstein-de Sitter solution. 


25.1.2 The Closed Model (k = 1) 

For the closed model with k = 1, (25.4) gives po > pc and from (25.10) we find qo > 1/2. 
The Friedmann equation (24.56) then becomes 

C?2 Q-n-n^ C>3 


R 2 i 8jtGpoRq B 
~C Z+1 ~ 3 c 2 R = R’ 


(25.18) 


where we introduce a constant B as 

B = 


8ttGpqRq _ 2^ n Gpo HqR o 


3c 2 


3 Hi 


(25.19) 


Using the definition of the present-epoch deceleration parameter (25.10) we obtain 


B = 2qo 


tt2 p 3 


Furthermore, using (25.4) for k = 1, we may write 




R 2 0 c 2 


3 H 2 


{2q 0 - 1) , 


or 


Ro = wJ 2qo ~ 1 )~ 1/2 - 


Substituting (25.22) into (25.20) we obtain 


B = 


2qo 


(2q 0 - if 2 Ho' 

Equation (25.18) maybe rewritten in the form 


Id R _ B-R 
c At V R 


[ { cdt = f 1 
Jo Jo 


R 


B-R 


-AR. 


Let us now introduce an angular parameter q as follows: 


with 


■? r] B 

R = Bsin 2 - = — (1 — cos n) , 
2 2 


AR = Bsin - cos -dn. 
2 2 


Using (25.25) and (25.26) we may write 


R y q B 

^—j^dR = Bsi rf -Aq = - (1 - cos q) dq. 


(25.20) 


(25.21) 


(25.22) 


(25.23) 


(25.24) 


(25.25) 


(25.26) 


(25.27) 
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Using ( 25 . 25 ) and performing the elementary integration in ( 25 . 24 ) with ( 25 . 27 ), we obtain 
the two parameter equations 

B B 

R = — (1 — cos ri ) , ct = — (ij — sm rj ) . (25.28) 


The parameter equations ( 25 . 28 ) define the scale radius R(t) by a parameter rj. The 
function R(t) defined by ( 25 . 28 ) is a cycloid and the scale radius is a cyclic function of 
the cosmic time. The universe starts expansion at the cosmic time t = 0 [rj = 0) with 
scale radius R = 0 and expands to a maximum size R = B at the cosmic time t = t\J2 
( r) = n). Thereafter the universe is contracted back to the scale radius R — 0 at the 
cosmic time t = q (rj = 2tc). The time t l in the closed model is called the lifespan 
of the present universe. It can be calculated from ( 25 . 28 ) with ( 25 . 23 ) for rj = 2n, as 
follows: 


B nB 2nqo 1 
th 2c n c ( 2 q 0 _ i) 3/2 H 0 ' 


(25.29) 


For example, the choice q 0 = 1 gives the lifespan of the universe equal to /| = 27t/Hq. 
From ( 25 . 22 ) and ( 25 . 22 ) we also note that for q 0 = 1, the present scale radius Rq = c/Hq 
is equal to one half of the maximum scale radius B = 2c/Hq. It means that in the closed 
model for q 0 = 1, the present universe will expand to twice its present size before it starts 
contracting. 


25.1.3 The Open Model (k = -1) 

For the open model with k = — 1, ( 25 . 4 ) gives po < Pc and from ( 25 . 10 ) we find qo < 1 / 2 . 
The Friedmann equation (24.56) then becomes 


R 2 _ 8jt GpoRq _ B 


(25.30) 


c- 1 3 c 2 R R’ 

where we again use the constant B given by ( 25 . 23 ). Equation ( 25 . 30 ) may be rewritten in 
the form 

r R 


Id R _ B + R 

c dt V R 


f cdt = f 
Jo Jo 


R 


B + R 


-d R. 


Let us now introduce an angular parameter ?/ as follows: 


(25.31) 


with 


R = B sinh 2 - = — (cosh u — l) , 


Y1 T] 

d R = B sinh - cosh -d n. 

2 2 


Using ( 25 . 32 ) and ( 25 . 33 ) we may write 


d R = Bsinh 2 -d;; = — (cosh;; — l) d;;. 
B + R 2 2 


(25.32) 


(25.33) 


(25.34) 
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Using (25.32) and performing elementary integrations in (25.31) with (25.34), we obtain 
the two parameter equations 


B B 

R = — (cosh r] — l) , ct = — (sinh r) — ri) . 


(25.35) 


The parameter equations (25.35) define the scale radius R(t) by a parameter rj. The 
function R(t) defined by (25.35) is growing indefinitely as L — > oo (77 — > oo). Like the flat 
Einstein-de Sitter solution, the open solution continues to expand forever. It should also 
be noted that the expansion of the universe in the open model is faster than that of the flat 
model due to the presence of the exponential functions in the parameter ij. 


25.2 Closed or Open Universe 


The analysis of the three possible models for the expansion of the universe, presented in 
Chapter 24, does not resolve the question whether the present universe is open or closed. 
The answer to that question must be looked for in the astronomical observations and 
estimates of the various parameters of the model. The current estimates of the present- 
epoch Hubble constant Hq and the critical density p c are 


H 0 


Pc 


= 100a- 


km 1 


Mpc’ 


= 2 x 10“ 26 a 2 ^|, 

rrw 


0.5 < a < 1. 


(25.36) 


The estimates of the present-epoch deceleration parameter qo are more difficult to obtain. 
In order to study the deceleration parameter qo, we use the result for the Hubble constant 
H = R/R to calculate the second derivative of the scale radius with respect to the cosmic 
time as follows: 


R = HR =>■ R = HR + HR = H 2 R + HR. 


(25.37) 


Substituting (25.37) into (25.7) we obtain 

1 R ( H \ 
q{t) -~JpR--{lfi +1 )- 

Using (25.38) we can calculate the present-epoch deceleration parameter qo as 


t25 - 39 > 

The result (25.39) indicates that if we can measure H(to) and H (to) we can calculate the 
present-epoch deceleration parameter qo . In order to measure H(to) we use the fact that as 
we look deeper into space at the same time we look farther back into time. For example, if 
we estimate the Hubble constant H(t) for objects one billion light years away, we are really 
estimating the Hubble constant H(t) for the universe one billion years back in the cosmic 
time. The difficulty with such a method of determining the present-epoch deceleration 
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parameter qo is related to the difficulties to measure the distances to the objects deep in 
space. 

In spite of these difficulties, some existing observational data suggest the value of the 
present-epoch deceleration parameter qo = 1. As we have concluded before, for qo > 1/2, 
the universe is closed and such observational data suggest that the present universe is 
closed. However, using (25.10) for qo = 1, we obtain the following present-epoch matter 
(or rest-energy) density of the universe 

J^g 

p 0 = 2p c = 4 x 10 _26 fl 2 -|, 0.5 < a < 1, (25.40) 

irP 

which is much larger than the observed density of matter in the universe. This discrepancy 
is the origin of the so-called problem of the missing mass in the universe. The problem was 
identified by measurements of the masses of clusters of galaxies in two different ways. One 
method was to exploit a definite relation between the luminosity of a galaxy and its mass, 
to sum up all the masses of member galaxies to obtain the total mass of the cluster. The 
other method was to measure the relative velocities between the member galaxies and 
to calculate the mean relative velocity which is determined by the mass of the cluster. It 
was discovered that the luminosity mass obtained by the first method was considerably 
smaller than the dynamical mass obtained by the second method. 

Thus, there is a reason to believe that there is a large amount of invisible matter within 
the observed clusters of galaxies. The first decisive evidence of the existence of such dark 
matter in the universe came from the rotation curves of galaxies. By measurements of the 
rotation curves of smaller galaxies revolving around great spiral galaxies it was discovered 
that they differ completely from the curves expected from the Newtonian mechanics in 
the empty space. The only possible interpretation of this result is that the space around the 
galaxies is not empty. On the contrary, it seems to consist of a considerable amount of dark 
matter. It is today generally believed that up to 90the form of the high-temperature ionized 
gas emitting X-rays, and X-rays have indeed been detected in the clusters of galaxies. 
However the observed density is again far below the density required to account for the 
large amount of the dark matter in the universe. There are enormous numbers of neutrinos 
in the present universe. Some experiments indicate that they may have a rest mass of the 
order of 10 -35 kg, and if this is true the neutrinos may just be the missing matter. This is 
a growing area for combined efforts of astrophysics and particle physics today, although 
it is still not clear how the neutrinos could impact the rotation curves of galaxies in the 
observed way. 

25.3 Newtonian Cosmology 

In this section, we study the evolution of an expanding universe within the framework of 
the Newtonian theory of gravitation and compare the results with those obtained using the 
general theory of relativity. Let us imagine the large-scale matter-dominated Newtonian 
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universe as an ideal fluid with negligible pressure moving according to the Newton laws 
of motion and gravity. Such fluid is often called the Newtonian dust. We now consider 
the Newtonian dust with a uniform density p(f) being in the state of uniform expansion. 
The only force acting on the dust particles is the force of gravity. The components of the 
three-dimensional position vector of an arbitrary dust particle at some cosmic time t are 
given by 

x“(f) = R(t)k a {a = 1, 2, 3), (25.41) 


where X is a constant vector defined by the initial position of the dust particle, and R(t) 
is the scale radius of the uniform expansion. The first and second time derivatives of the 
coordinates (25.41) are given by 


= i?r = -x a = hx 01 , = i?r = -x a , (25.42) 

R R 

where H(t) = R/R is a Hubble constant of the Newtonian expansion. Let us now use the 
continuity equation (19.54) in the form 

^ + d a (pd*) = + x“3 a p + p 3q.x“ = 0. (25.43) 

Using (25.42) and the definition of the total time derivative of the density p, given by 


P 


we may rewrite (25.43) as follows: 


dp dp dp dx 01 
df dt dx a dr 


(25.44) 


p + pHd a x f = 0 =>■ p + 3 pH = 0, 


(25.45) 


or 

' j+3 r-s4(' ,B3 )-° (25 - 46) 

Integrating (25.46) we recover the result (24.53) obtained using the Friedmann equations, 
that is, 

pR 3 = po-Rg = Constant, (25.47) 


where po and Rq are the matter density and the scale radius at the present epoch [t = to). 
From the result (25.47) we see that in the Newtonian cosmology the matter density also 
varies in time as R~ 3 . The isotropy and spherical symmetry of the universe require that any 
spherical volume evolves only under its own influence. If the observed spherical volume 
has the radius r and mass M(r) = (4jt/3)i 3 p, the equation of motion of a dust particle 
somewhere on its surface is given by the Newtonian equation (18.44), as follows: 


■■a R a GM(r) 

X 01 = -x“ = 5 — J 

R C 

As (25.48) is valid for an arbitrary x", we obtain 


4?r Gp 


x“. 


(25.48) 


3 
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R=-^R. 


Using now the result (25.47), (25.49) becomes 

47rGpoRo 


R= -- 


3R 2 


Multiplying now both sides of (25.50) by R, we obtain 

4jrGpoRn R 
RR + — = 0, 


or 


R 2 


1 d R 2 4tvGpoRq d / 1 


2 dt 


dt\R 


- = 0 . 


Thus, we may write 


A (if _ 8?r GpqRq \ = 
dty 3 Rj 


(25.49) 

(25.50) 

(25.51) 

(25.52) 

(25.53) 


Integrating (25.53) we obtain 


R 2 


ZxGppRl = _ 2 

3 R 


(25.54) 


where kc 2 is the integration constant. After some restructuring of (25.54) we recover the 
second Friedmann equation (24.56), that is, 


if + kc 2 8jrGpo R 3 0 

R2 = § ^3' 


Furthermore, using the result (25.54) rewritten as 

47rGpoRg 1 R 2 + kc 2 
3R 2 = 2 R 


Equation (25.50) becomes 


1 R 2 + kc 2 
R = 2 R 


(25.55) 


(25.56) 


(25.57) 


Dividing by R and restructuring of (25.57) we recover the first Friedmann equation (24.55), 
that is, 


R R 2 4- kc 2 
2 R + — R2 


= 0. 


(25.58) 


From the above discussion we conclude that the Newtonian cosmology gives the same 
dynamic equations for the scale factor R(t) of the expanding universe as the relativistic 
cosmology based on the Robertson-Walker metric. In the Newtonian cosmology the 
parameter k is just an integration constant, while in the relativistic cosmology k is the 
curvature constant. In fact if k^0 there is no loss of generality if we set k — ±1 in the 
Newtonian cosmology either. The dynamic equations (25.55) and (25.58) lead to exactly 
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the same three models of the expanding universe with k — — 1, 0, 1. It should however be 
kept in mind that the Newtonian cosmology is just a non-relativistic limit of the standard 
relativistic cosmological model and that it cannot account for a number of important 
physical features of the expanding universe. In particular it cannot handle the radiation- 
dominated early phases of the expanding universe and incorporate the pressure of the 
cosmic fluid in the proper way. 




The Quantum Cosmology 



The objective of the present chapter is to give a short introduction into the most profound 
speculations about the earliest moments of the present universe. In the solutions of Fried- 
mann equations, presented in Chapter 25, we have generally used the initial condition 
R( 0) = 0. In effect such an initial condition means that in the earliest moments of its 
evolution, the present universe may have been so compact that its size was comparable 
to the size of a single quantum particle. In such an early epoch the classical solutions of 
the Friedmann equations, derived in Chapter 25, are no longer adequate and we need 
a quantum theory of the very early universe, that is, the quantum cosmology. Since the 
quantum cosmology is a highly speculative theory in its early development phase, the 
presentation in this chapter will be limited to a few introductory topics. Furthermore, as 
the quantum theory is outside the scope of the present book, a few elementary quantum- 
mechanical results needed in the present chapter will be introduced without a detailed 
derivation from the first principles. 

26.1 Introduction 

The non-static models of the expansion of the universe, discussed in Chapter 25, are 
based on a well-formulated gravitational field theory and available observational data. 
These models can be used to study the early moments of the universe down to the times 
t < 10 -2 s. The subject of the present chapter is to investigate the so called Planck epoch, 
that is, the times t < 10 -43 s. 

The approach pursued in the present chapter is to use the canonical quantization 
procedure to quantize the gravitational degrees of freedom and to derive the Klein-Gordon 
wave equation for the wave function of the universe governing both the matter fields and 
the space-time geometry. Such wave equation is generally known as the Wheeler-De Witt 
equation. The wave function of the universe in this approach is a function of all possible 
three-dimensional geometries and matter field configurations, known as superspace. It 
should be noted that the concept of superspace used in quantum cosmology has nothing 
to do with the concept of superspace of the supersymmetric quantum theories. 

In order to reduce the problem to a manageable one, all but a finite number of 
degrees of freedom are frozen out, leaving us with a finite -dimensional superspace known 
as the mini-superspace. In the present chapter we consider a simple mini-superspace 
model in which the only remaining degrees of freedom are the scale radius R(t) of a 
closed homogeneous and isotropic universe and a homogeneous massive scalar field (p. 
Furthermore, all the degrees of freedom of the scalar field are also frozen out, and the 
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only effect of the scalar field is to provide the vacuum energy density p v ac . Such a vacuum 
energy density contributes to a cosmological constant term defined by A = 87rG/>vac- 
Using the mini- superspace model described above we then calculate the semi-classical 
wave functions of the universe and the tunneling probability for the universe to make the 
transition from R — 0 to the transition point R = Ro, being the upper limit of the classically 
forbidden region 0 < R < Rq. 


26.2 Wheeler-De Witt Equation 


Let us start the derivation of the Wheeler-De Witt equation by recalling the Friedmann 
equations of cosmic dynamics, given by (24.44) and (24.45), that is, 


R 2 + kc 2 8 tzG 

R2 - ~3~ p 


( 26 . 1 ) 


R R 2 + kc 2 

2 r + ~r 2 


8jtG 
2 “ 


( 26 . 2 ) 


In the very early phases of its evolution, the universe was clearly not matter- dominated. In 
fact it is generally believed that in the Planck epoch it was not even radiation-dominated, 
and the energy- momentum tensor was determined by the vacuum energy. In such a 
vacuum-dominated epoch the pressure of the extremely dense matter of the universe is 
given by p = —pc 2 . Using (24.54), we see that in the vacuum-dominated epoch we have 


p = 0 =>■ p = pv ac = Constant. 


( 26 . 3 ) 


Thus, the Friedmann equations (26.1) and (26.2) in the closed model (k — + 1) become 


R 2 + c 2 8 nG A 

R 2 = _ § _Pvac “ 3 " 


( 26 . 4 ) 


R R 2 + c 2 


2 r + 


R 2 


= 87T Gpvac = A. 


Substituting (26.4) into (26.5) we obtain 


R A R A 

2 R + 3 =A ^R = 3' 


Equation (26.6) can be written as follows: 


R - —R= 0, 


R 0 = c x — . 
A 


The classical solution of (26.7) is given by 

R(t) = Ro cosh ■ 


( 26 . 5 ) 


( 26 . 6 ) 


( 26 . 7 ) 


( 26 . 8 ) 


This solution describes a universe which was infinitely large (/? -»• oo) in the infinite past 
(f — > — oo). Then it has been contracted to a minimum size of Ro at l = 0 after which it 
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starts expanding again to an infinite size in the infinite future (f — > +oo). From (26.8) we 
see that the classical solution has a forbidden range for 0 < R < Rq. Thus, it is classically 
impossible for the universe to start from R = 0 and evolve into the above cosmic model. 
However, given the quantum mechanical nature of the problem in the Planck epoch, it 
may be possible for the universe to make such a transition with some finite quantum- 
mechanical tunneling probability. In order to calculate the wave functions of a universe 
starting at R = 0 and tunneling into the region R> Rq and the tunneling probability of such 
a transition, we now construct the action integral of the universe in the Planck epoch as 
follows: 

h = [*(*).•&(»)] J v ^ drde d <P- (26-9) 

where 


y^gdrde dtp = R 3 


r 2 dr 

VT^T 2 


sin6» d0 dtp. 


(26.10) 


In (26.9) we have used the result for the Ricci scalar (24.35) in the Robertson-Walker metric 
with k = + 1, that is, 



/ R R z + c 2 \ 

\ R + ~R —)’ 


(26.11) 


to conclude that the Lagrangian density C [R(f),R(f)] in the present model is space 
independent. Thus, we can integrate out the spatial part of the action integral (26.9) , which 
is just a the three-dimensional spherically symmetric volume in the Robertson-Walker 
curved space-time metric (23.49), to obtain 


r r 1 7* 2 d/' C n 

/ yJ^gdrdddp = R 3 / ; / sin0d0 / d <p (26.12) 

Jv Jo VI — r 2 Jo Jo 

Integrating the elementary integrals over 6 and cp and introducing the new angular variable 


X as 


r = sin x => dr = cos x dx . 


(26.13) 


the result (26.12) becomes 

f /■*/ 2 

/ y/^g dr d8 d<p = 4n R 3 / sin 2 / dx = 2n 2 R 3 . (26.14) 

Jv Jo 

Substituting (26.14) into (26.9) we obtain 

I G = -^ J t d tR 3 (t)£[R(t),R{t)]. (26.15) 

Using the result for the Ricci scalar (26.11) in the Robertson-Walker metric with k = +1, 
we construct the Lagrangian density of the present model as follows: 


c[R(t),m] 


6 

c 2 




(26.16) 
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where we have eliminated the term including the second- order time derivative, which 
should not appear in the Lagrangian density, by reducing it to the total time differential. 
Substituting (26.16) into (26.15), we obtain 

f 3 nc 2 

Ig = j d tUt) = 


4 G 


where the Lagrangian L(t ) is given by 


L(t) = - 


3jnr 

4G 


J^dt (RR 2 - Rc 2 + ^R 3 ^) 

)■ 


(26.17) 


RR 2 - Rc 2 + - R :i 


dR d t\dh) °' 


The Lagrange equation for the scale radius R(t) is then given by 

dL d fd L 


)• 


Substituting (26.18) into (26.19), we obtain 


3jtc 2 
" 4G 




R 2 -c 2 + A R 2 - 2 RR - 2 R 2 


or 


R R 2 + c 2 

2 — -f- ~ — — A. 

R R 2 


(26.18) 


(26.19) 


(26.20) 


(26.21) 


Thus, the Lagrangian (26.18) gives the correct equation (26.5) for the scale radius R(t). Let 
us now define the momentum conjugate to the scale radius R(t), as 

3nc 2 


dL 

Pr = Jr = 


2 G 


-RR. 


(26.22) 


Using (26.22) we may express R in terms of pr as follows: 

R = -^ (26 - 23) 

37 rc z R 

Using (26.22) and (26.23) we obtain the Hamiltonian of the present model as follows: 


H = p R R-L = - 


G p\ 3 nc 2 


3nc 2 R 4 G 


Rc 2 - | R 3 


(26.24) 


The Hamiltonian (26.24) is a classical Hamiltonian of the present mini-superspace model. 
The quantum-mechanical Hamiltonian operator H is obtained using the canonical quan- 
tization prescription 


PR 


ih — , i = 
9R 


(26.25) 


The wave equation of the universe in the present mini-superspace model known as 
Wheeler-De Witt equation is then obtained from the condition H — 0 as follows: 


It'll (R) = 


h 2 G d 2 3nc 2 


3nc 2 R dR 2 4 G 


Rc 2 - ^-R 3 


'L(R) = 0, 


(26.26) 
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or 



9 2 

9 # + U( ™ 


W(R) = 0, 


with the potential U(R) defined by 


U(R ) = 


/3^C 3 i? 0 \ 2 

V 2G J 




(26.27) 


(26.28) 


The Wheeler-De Witt equation written as in (26.27) has the familiar form of the one- 
dimensional Schrodinger equation for a particle with zero total energy moving in the 
potential U(R). The form of the potential (26.28) clearly indicates that there is a classically 
forbidden region for 0 < R < Rq and a classically allowed region for R > Rq . The point 
R — Ro is a turning point of the potential (26.28). Classically speaking and using the 
particle analogy, we conclude that a particle at R — 0 is stuck there and cannot travel 
to the region R>Rq. However, in the quantum mechanics there is a finite probability 
that the particle can tunnel through the barrier and emerge at the classical turning point 
R — Rq. Thereafter it can evolve classically in the region R > Rq. 


26.3 The Wave Function of the Universe 


In order to find the solutions of the Wheeler-De Witt equation for the wave function of 
the universe T (R) and to calculate the tunneling probability of the universe through the 
potential barrier (26.28) we restructure (26.27) as follows: 


d 2 ^ ■, 

w + ^ = °' 


(26.29) 


where z — R/Ro is a dimensionless scale variable and the function Q 2 (z) is defined by 


Q^z) = a 2 (z 4 - z 2 ) , 


a = 


37rc 3 .Rg 9 jtc 5 


(26.30) 


*Cz) = 


(26.31) 


2hG 2hGA ’ 

Using the WKB approximation the wave function of the universe T (z) is given by 

2 lVo|Q(.z)r 1/2 exp |w(z)|, z < 1 (R < R 0 ) 
AT 0 |Q(z)r 1/2 cos(|w(z)| - |), z > 1 (R>R 0 ) 

where No is the WKB normalization constant, which is not essential for the present 
analysis. The function w(z) is given by 

. 3/2 


w(z) = j Q(z) dz = | (z 2 — 1^ 

For example in the region where R^> Rq the wave function (26.31) becomes 

'I'(Z) = NqCC ^/ 2 - COS (^■Z 3 — ■ 


(26.32) 


(26.33) 
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and it is a slowly falling periodic function. Let us now turn to the tunneling probability 
for the universe to make a transition from R = 0 to the transition point R = Rq. The most 
general WKB result for the tunneling probability is given by 

-l 


T = 


1 + exp ^2 J Q(z)dz^ = 


2 a 

1 + exp I - — 


Substituting a from the result (26.30) into (26.34), we finally obtain 


T = 


. 3 nc 5 \ 

1 +exp i-scA) 


-,-i 


(26.34) 


(26.35) 


It should be noted that in many quantum-mechanical problems the exponential in the 
expression for the tunneling probability (26.34) is a large quantity. It is therefore customary 
in the literature to use the approximate expression 


T = 


1 + exp 




(26.36) 


However, since the gravitational action integral is a negative quantity, such an approxi- 
mation is invalid in the present discussion and only the more general result (26.34) can 
be used. Even though the energy density pvac and thereby the cosmological constant A of 
the vacuum- dominated Planck epoch are very difficult to estimate, in order to get the idea 
of the orders of magnitude of these quantities let us assume that the quantity a given by 
(26.30) is of the order of unity. This gives the tunneling probability of the order of T ~ 2/3. 
Using now the result (26.30), we obtain 


9jrc 5 

11 ~ 2hGA ~ 

Using now A = 87rGp V ac> we have 

Pvac — 


=>• A 


9jtc 5 
2 hG 


= 4.87 x 10+ 87 


1 


c2' 


A 

8?rG 


2.9 x 10+ 96 . 

ird 


(26.37) 


(26.38) 


The result (26.38) indicates an enormous energy density of the universe in the vacuum- 
dominated Planck epoch. Furthermore, using the result (26.7) we can estimate the radial 
distance of the turning point of the potential (26.28) as follows: 


Rq = ct 0 = cj — 
A 


7.44 x 10“ 36 ] 


(26.39) 


The length scale of this order of magnitude is sometimes called the Planck length. The 
time scale to corresponding to this length scale is then given by 


to = 



2.48 x 10“ 44 s, 


(26.40) 


and it is roughly to < 10 43 s, which is in agreement with our assertion in the introduction 
to the present chapter. 
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A remarkable, and surprising, property of the wave function of the universe is the fact 
that it is time-independent and depends only on the space-time geometry and the matter 
field content. A possible interpretation of the wave function Tfif?) is that it measures 
probabilistic correlations between the geometry and the matter field content. In this 
interpretation it is possible to use some function of the matter fields, for example, the 
energy density of the matter fields, as a surrogate time variable. However neither the role of 
time in the quantum cosmology nor the interpretation of the wave function of the universe 
are fully understood yet. 

We also note that just having the wave equation of the universe does not resolve the 
issue of the quantum evolution of the universe. To be more specific about the quantum 
evolution of the universe, we need information about the initial quantum state. There 
are currently several proposals for such an initial quantum state giving different physical 
predictions. If such predictions can be made to be sufficiently precise, then the present- 
day observations could be used, at least in principle, to test these different proposals for 
the initial quantum state of the early universe. 

The quantum cosmology efforts around the world today are very ambitious and the 
very limited presentation given in the present chapter only gives some flavor of this 
exciting subject. It is our hope that after mastering the material discussed in this chapter, 
an interested reader will be able to proceed to the more advanced monographs on the 
subject. It is also important to note that despite a number of speculative efforts to create 
the self-consistent and empirically supported cosmological quantum theory, a lot of work 
remains to fulfill that goal and several fundamental questions still remain unresolved. 
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Cosmological 
principle, 214 
red shifts, 223 
scale, 213 
Cosmology, 2 
Newtonian, 245 
Covariant derivatives 
commutator of, 88 
of contravariant vector, 58 
of covariant vector, 58 
properties of, 61 
of tensors, 60 
Covariant index, 58 
Covariant tensors, 23 
Covariant vectors, 21, 22 
differentials of, 58 
Curl of vectors, 72 
Current density vector, 133 
Curvature constant, 220, 225 
Curvature tensor 
alternative expression for, 87 
components of, 92 
covariant, 87, 90, 92 
cyclic identity, 87 
definition of, 85 
mixed, 87 
properties of, 87 

D 

Dark matter, 242 

Deceleration parameter, 223, 233 

Degeneracy, 52 

5-symbol, 12 

Density 

charge density, 133 
critical/ closure matter 
density, 238 

Lagrangian density 166 
Descartes coordinates 
inversions of, 36 
mixed product, 33 
rotations of, 34/, 35 
translations of, 36 
vector product, 32 
Determinant of metric tensor, 39 


Differential operators 
curl of vectors, 72 

divergence of vectors and tensors, 71 
gradient of scalar function, 70 
Hamiltonian V-operator, 69 
Laplacian of scalars and tensors, 73 
Direct product of systems, 8 
Divergence 
of tensors, 71 
of vectors, 71 

Doppler shift, 213. See also Cosmological, 
red shifts 

Dummy indices, 10 

E 

Eigenvalues of linear operator, 49 
Eigenvectors of linear operator, 49 
Einstein-de Sitter solution, 235 
Einstein field equations, 169 
linearized, 202 

Electric-charge conservation law, 133 
Electric field vector, 123 
Electric scalar potential, 122 
Electromagnetic 
current four-vector, 136, 138 
current vector, 133 
field invariants, 139 
Electromagnetic field 
potential of, 122 
tensor, 125 

Electromagnetic field equations 
charge density, 133 
electromagnetic current vector, 133 
electromagnetic potentials, 138 
Maxwell equations, 137 
Electromagnetic potentials, 138 
Elements /coordinates of the system, 6 
Energy 

rest energy, 113 
total energy, 113 
Energy-momentum tensor 
of cosmic fluid, 235 
electromagnetic field, 163 
of electromagnetic field, 143 
Lagrangian density, 166 
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of mass distribution, 166 
matter fields, 163 
Energy-momentum vector, 115 
conservation of, 118 
transformations of, 117 
Epoch 

matter- dominated epoch, 235 
Planck epoch, 247, 251 
vacuum-dominated epoch, 251 
Equation of motion 
of dust particle, 245 
of free particle, 112 
temporal, 193 

e- symbol in three dimensions, 31 
e-system, 12 

Euclidean metric spaces, 15, 77 
Events 
horizon, 193 

interval between events, 99 

F 

Field equations of physics, 156 

First-order system, 6 

Flux 

of electric charges, 133 
of mass, 139 

Four- acceleration, of particle, 103 
Fourth-order mixed tensor, 24 
Four- velocity, 103 
Free vector, 13 

Friedmann equations, 233-235, 237 
for closed model, 239-240 
critical density, 237-238 
deceleration parameter, 237-238 
for flat model, 238-239 
Hubble constant, 237-238 
for open model, 240-241 
Fundamental forces of nature, 1 
Fundamental observer, 215 

G 

Galilei transformations, 100 
Gauge 

condition, 125-126 
harmonic condition, 202 


invariance, 126 
Lorentz, 125-126 
of the theory, 125-126 
transverse traceless gauge, 205 
Gauss curvature of unit sphere, 91 
Gauss theorem, 75 
General theory of relativity, 1, 2 
local inertial frames of reference, 148 
time intervals and distances, 148, 149 
Geodesic differential equations, 77 
Geodesic equations, 82, 190, 193 
Geodesic lines, 77 
Geometry 

pseudo-Euclidean, 147 
pseudo-Riemannian, 147 
Gradient of scalar function, 70 
Gravitational constant, 153, 158 
Gravitational field equations 
action integral, 161 
Einstein field equations, 169 
matter fields, 166 
Newton law, 172 
non-relativistic limit of, 172 
Schwarzschild solution, 182 
Gravitational fields 
electromagnetic field equations, 156 
gravitation field potentials, 147 
particle dynamics, 153 
time intervals and distances, 150 
Gravitational potential, 153, 172 
Gravitational radius, of body, 182 
Gravitational waves 
linearized Einstein field equations, 202 
plane-wave solutions, 205 
on point particles, 207 
radiation, generation of, 210 

H 

Hamiltonian 
covariant operator, 69, 70 
V-operator, 69 
variational principle, 77 
Harmonic gauge condition, 202 
Hubble constant, 214, 221 
Hubble law, 214, 221 
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Indices 

dummy indices, 10 
lower indices, 6 
upper indices, 6 
Inertial frame of reference, 147 
Inertial systems of reference, 95 
Infinitesimal arc length element, 44 
Integral theorems, for tensor fields 
Gauss theorem, 75 
Stokes theorem, 75 
Interaction Lagrangian density, 133 
Interactions 
gravitational, 147 
maximum speed of, 95, 96 
Invariance principle, speed of light, 96 
Invariant Lagrangian density 
function, 136 

of gravitational field, 157, 158 
Invariants, 22 

and Lorentz transformations, 129 
Inversions, of Descartes coordinates, 36 

J 

facobian 

facobi theorem, 75 
of transformation, 29, 30, 32, 36 

K 

Kinetic 
energy, 113 

energy-momentum tensor, 120 
Klein-Gordon wave equation, 247 

L 

Lagrange equations, 79, 113 
Lagrange function, 77 
Lagrangian, 77 
of charged particle, 122 
equation, 190 
of a planet, 190 
Laplacian 
of scalars, 73 
of tensors, 73 


Length 

contraction, 108 
proper length, 106 
Lifespan, of the universe, 240 
Light 
cones, 193 
speed of light, 96 
Linear algebra, 6 

Linearized Einstein field equations, 202 
Linear operator, 49 
eigenvalues of, 49 
eigenvectors of, 49 
main directions of, 51 
secular equation, 49 
Local inertial frames of reference, 148 
Lorentz force, 123-124 
Lorentz gauge, 125-126 
Lorentz transformations, 102 
and invariants, 129 

M 

Magnetic induction vector, 123 
Magnetic vector potential, 122 
Main directions, of linear operator, 51 
Mass conservation law, 166 
Mass moments 
dipole, 210 
monopole, 210 
quadrupole, 210 
Mathematical space, 13 
axioms of, 13 
Matrix 

adjunct matrix, 41 
co-factors of, 41, 98 
column matrix, 6 
determinant of, 12 
transposed matrix, 41 
unit matrix, 41 
Matter fields, 166 
Maxwell equations, 137 
Maxwell stress tensor, 143 
Metric 

definition, 17 

Robertson- Walker metric, 220 
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Metric space 
Euclidean, 15 
pseudo-Euclidean, 98, 99 
Riemannian, 44 
Metric tensor, 157 
and Christoffel symbols, 68 
contravariant, 41 
definition, 39 
determinant of, 39 
Mini-black holes, 195 
Mini-superspace, 248 
Mixed tensors, 24 

Mixed vector product, in three dimensions, 33 

N 

IV -dimensional vector space. See Vector spaces 
Newtonian 
cosmology, 245 
dust, 245 
Newton law, 172 
non-relativistic limit, 172 
theory of gravitation, 190 
Newton law, 172 
Non-relativistic limit 
of action integral, 113 
of gravitational field equations, 172 
of kinetic energy, 113 
Lagrangian, 113 
of momentum, 113 
Newtonian, 172 

Non-relativistic mechanics, Galilei 
transformations of, 100 
Non-static models of the universe, 237 
Friedmann equations, 237-241 

O 

Operations with systems 
addition and subtraction, 8 
composition, 9 
contraction, 8 
direct product, 8 
Origin of coordinates, 14 
Orthogonal coordinate transformations, 37 
Orthonormal eigenvectors, 51 


P 

Particle dynamics, 153 
Perihelion 
advance, 190 
of a planet, 190 
shift, 190 

Perturbation method, 190 

Physical coordinates, of velocity vector, 47 

Planck epoch, 247, 251 

Planck length, 252 

Plane, in vector spaces, 15 

Point, 13 

Polarization 

of gravitational plane waves, 205 
tensors, 205 
Polar vectors, 36, 37 
Potentials 
electric scalar, 122 
electromagnetic, 138 
magnetic vector, 122 

Present-epoch deceleration parameter, 238, 
239, 240, 242 

Proper length, of object, 106 
Proper time interval, 105 
Pseudo-Euclidean metric tensor, 148 
Pseudo-Euclidean space, 19 
Pseudo-Riemannian space, 19 
Pseudoscalars, 37 

Pseudotensors. See Relative tensors 

Q 

Quadrupole mass moment, 210 
Quadrupole radiation, 210 
Quantum cosmology, 247, 253. See also 
Universe 

R 

Real IV- dimensional space, 13 
Relative tensors, 29 
Relativistic addition of velocities, 109 
Relativistic astrophysics, 2 
Relativistic kinematics 
interval between events, 99 
relativity, principle of, 95 
speed of light, invariance principle of, 96 
velocity and acceleration vectors, 103 



262 Index 


Relativistic phenomena 
length contraction, 108 
relativistic addition of velocities, 109 
time dilatation, 106 
twin paradox, 110 
Ricci antisymmetric tensors, 41 
Ricci scalar, 89, 158, 161 
Ricci tensor, 89, 172, 182, 233 
Riemannian spaces, 19, 77 
Robertson- Walker metric 
contravariant metric tensor for, 233 
covariant metric tensor for, 233 
definition and properties, 220 
Einstein tensor components in, 233 
Rotations, of Descartes coordinates, 

34/, 35 

S 

Scalar product, 15 
Scalars 

Laplacian of, 73 
pseudoscalars, 37 
scalar product, 15 
Scale radius, 220, 225, 233 
Schwarz inequality, 46 
Schwarzschild solution, 182 
Schwarzschild space-time metric, 182 
Second-order contravariant tensor, 23 
Second-order covariant tensor, 23 
Second-order system, 6 
Second-order tensors, 49 
Secular equation, 49 
Simple system. See First-order system 
Space character, 85 
Space -time 
coordinates, 96, 98 
geometry, 247, 253 
metric, 215, 220 
Space-time metric, 147 
Special relativity, 1 
Special theory of relativity 
electromagnetic field tensor, 155 
particle dynamics, 153 
time intervals and distances, 150 
Speed of light, invariance principle of, 96 


Stokes theorem, 75 
Straight line, in vector spaces, 15 
Substitution operator. See 8 -symbol 
Summation convention, 10 
Superspace, 247 
Surface vector, 75 
Symmetric system, 7 
Symmetric tensor, 24, 25 
Systems 

arbitrary order systems, 7 
first-order system, 6 
second-order system, 6 
third-order systems, 6 

T 

Tangent unit vector, 44 
Tensor calculus, 1 
lower and upper indices, 6 
Tensors. See also Linear operator 
absolute derivatives of, 62 
antisymmetric and symmetric parts of, 26 
character of systems, 28 
contravariant tensors, 23 
covariant derivatives of, 60 
covariant tensors, 23 
definition of, 21 
differentials of, 58 
divergence of, 71 
Laplacian of, 73 
mixed tensors, 24 
perturbation, 198 
polarization, 205 

relative tensors ( see Relative tensors) 
symmetry properties of, 25 
tensor capacities, 29 
tensor densities, 29 
trace-reverse tensor, 202 
Third-order contravariant tensor, 23 
Third-order covariant tensor, 23 
Third-order systems, 6 

Three-dimensional momentum conservation 
law, 118 

Time 

dilatation, 106 
proper time interval, 105 
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synchronized, 105, 107 
intervals and distances, 150 
Total energy, of particle, 113 
Trace-reverse tensor, 202 
Transformation laws 
of arbitrary tensor, 3 1 
of Christoffel symbols, 65 
of electric field vector, 128 
Transformations 
of coordinates, 22 
of energy- momentum vector, 117 
Galilei transformations, 100 
of variables, 21 

Translations, of Descartes coordinates, 36 
Transverse traceless (TT) gauge, 205 
Twin paradox, 110 

U 

Unit antisymmetric system. See e-system 
Unit antisymmetric tensors, 31 
Unit matrix, 41 

Unit sphere, Gauss curvature of, 91 
Unit symmetric system. See 5-symbol 
Unit systems 
5-symbol, 12 
e-system, 12 
Unit vector, 44 
Universe 

closed or open, 242 
critical/closure matter density, 238 
dark matter in, 242 
expansion of, 225, 235 
large-scale structure, 213, 214 
lifespan, 240 
matter-dominated, 214 
matter- dominated epoch, 235 
missing mass problem, 242 
non-static models of, 225 
quantum evolution of, 253 
radiation-dominated, 214, 235 
tunneling probability for, 248, 251, 252 
wave function of, 253 


V 

Vacuum- dominated epoch, 251 
Variational calculus, 77 
Vector product, in three dimensions, 32 
Vectors 

angles between vectors, 45 
axial vectors, 37 
covariant derivatives of, 58 
curl of, 72 
divergence of, 71 
electric field vector, 123 
magnetic induction vector, 123 
polar vectors, 36, 37 
surface vector, 75 
tangent unit vector, 44 
unit vector, 44 
Vector spaces 
definition of, 15 
in N dimensions, 14 
plane in, 15 
straight line in, 15 
Velocity 

and acceleration vectors, 103 
four- velocity, 103 
relativistic addition of, 109 
Velocity vector 

contravariant coordinates of, 47 
physical coordinates of, 47 
in spherical coordinates, 47 

W 

Waves 

plane-wave, 205 
polarization, 205 
vector, 205 

Wheeler-De Witt equation, 247, 251 
WKB approximation, 252 
World, 96 
horizon, 214 
line, 96 
point, 96 



